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GLP-1 F0SION PROTEINS 



The present invention relates to glucagon -like peptides 
including analogs and derivatives thereof fused to proteins 
5 that have the effect of extending the in vivo half -life of 
the peptides. These fusion proteins can be used to treat 
non-insulin dependent diabetes mellitus as well as a variety 
of other conditions. 

10 Glucagon- Like Peptide 1 {GLP-1} is a 37 amino acid 

peptide that is secreted by the L-cells of the intestine in 
response to food ingestion. It has been found to stimulate 
insulin secretion (insul inotropic action) , thereby causing 
glucose uptake by cells and decreased serum glucose levels 

15 [see, e.g., Mojsov, S., (1992) Int. J. Peptide Protein 

Research, 40:333-343]. However, GLP-1 is poorly active. A 
subsequent endogenous cleavage between the 6^ and 7 th 
position produces a more potent biologically active GLP-1 (7- 
37) OH peptide. Numerous GLP-1 analogs and derivatives are 

20 known and are referred to herein as "GLP-1 compounds." 

These GLP-1 analogs include the Exendins which are peptides 
found in the venom of the GILA-mons ter . The Exandins have 
sequence homology to native GLP-1 and can bind the GLP-1 
receptor and initiate the signal transduction cascade 

25 responsible for the numerous activities that have been 
attributed to GLP-1 (7-37) OH. 

GLP-1 compounds have a variety of physiologically 
significant activities. For example, GLP-1 has been shown 
to stimulate insulin release, lower glucagon secretion, 

30 inhibit gastric emptying, and enhance glucose utilisation. 
[Nauck, M.A. , efc al. U993) Diabetologia 36:741-744; 
Gutniak, M. , et al. (1992) New England J. of Med. 326:1316- 
1322; Mauek, M.A. , efc al., (1993) J. Clin, invest. 91:301- 
307] . 
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GLP-1 shows the greatest promise as a treatment for 
noil- insulin dependent diabetes msllitus {NIDDM} . There are 
numerous oral drugs on the market to treat the insulin 
resistance associated with NIDDM, As the disease 
5 progresses, however, patients must move to treatments that 
stimulate the release of insulin and eventually to 
treatments that involve injections of insulin. Current 
drugs which stimulate the release of insulin, however, can 
also cause hypoglycemia as can the actual administration of 

10 insulin. GLP-1 activity, however, is controlled by blood 
glucose levels. When levels drop to a certain threshold . 
level, GLP-1 is not active. Thus, there is no risk of 
hypoglycemia associated with treatment involving GLP-1. 
However, the usefulness of therapy involving GLP-1 

15 peptides has been limited by their fast clearance and short 
half -lives. For example, GLP-1 (7-37) has a serum half -life 
of only 3 to 5 minutes. GLP-1 {7-36} amide has a time action 
of about 50 minutes when administered suhcutaaeously . Even 
analogs and derivatives that are resistant to endogenous 

20 protease cleavage, do not have half- lives long enough to 

avoid repeated administrations over a 24 hour period. Fast 
clearance of a therapeutic agent is inconvenient in cases 
where it is desired to maintain a high blood level of the 
agent over a prolonged period of time since repeated 

2 5 administrations will then be necessary. Furthermore, a 

long-acting compound is particularly important for diabetic 
patients whose past treatment regimen has involved taking 
only oral medication. These patients often have an 
extremely difficult time transitioning to a regimen that 

30 involves multiple injections of medication. 

The present invention overcomes the problems associated 
with delivering a compound that has a short plasma half- 
life. The compounds of the present invention encompass GLP- 
1 compounds fused to another protein with a long circulating 
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half -life such as the Pc portion of an immunoglobulin or 
albumin. 

Generally, small therapeutic peptides are difficult to 
manipulate because even slight changes in their structure 
5 can affect stability and/or biological activity. This has 
been especially true for GLP-1 compounds currently in 
development. For example, GLP-1 (7-37} OH has a tendency to 
undergo a conformational change from a primarily alpha helix 
structure to a primarily beta sheet structure. This beta 

ID sheet form results in aggregated material that is thought to 
be inactive, it was, therefore, surprising that 
biologically active GLP-1 fusion proteins with increased 
half-lives could be developed. This was especially 
unexpected given the difficulty of working with GLP-1 <7- 

15 37) OH alone and the large size of the fusion partner 
relative to the small GLP-1 peptide attached. 

Compounds of the present invention include heterologous 
fusion proteins comprising a first polypeptide with a 33- 
20 terminus and a c- terminus fused to a second polypeptide with 
a N~ terminus and a C- terminus wherein the first polypeptide 
is a GLP-1 compound and the second polypeptide is selected 
from the group consisting of 
a} human albumin? 
25 fo) human albumin analogs; and 

c) fragments of human albumin, 
and wherein the C- terminus of the first polypeptide is fused 
to the H- terminus of the second polypeptide. 

Compounds of the present invention also include a 
30 heterologous fusion protein comprising a first 

polypeptide with a N- terminus and a C- terminus fused to a 
second polypeptide with a H- terminus and a C~ terminus 
wherein the first polypeptide is a GLP-1 compound and the 
second polypeptide is selected from the group consisting of 
35 a) human albumin; 
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h) human albumin analogs; and 

c) fragments of human albumin, 
and wherein the C- terminus of the first polypeptide is fused 
to the N~ terminus of the second polypeptide via a peptide 
5 linker. It is preferred that the peptide linker is selected 
from the group consisting of: 

a} a glycine rich peptide; 

b) a peptide having the sequence [Gly-Gly-Gly-Gly-Ser] a 
where n is 1, 2, 3, 4, 5 or 6; and 
10 c) a peptide having the sequence [Gly-Gly-Gly-Gly- 

Ser] 3 , 

Additional compounds of the present invention include a 
heterologous fusion protein comprising a first polypeptide 
with a N- terminus and a C- terminus fused to a second 
15 polypeptide with a N~ terminus and a C- terminus wherein the 
first polypeptide is a GLP-1 compound and the second 
polypeptide is selected from the group consisting of 

a) the .Fe portion of an immunoglobulin; 

b) an analog of the Fc portion, of an immunoglobulin; 
20 and 

c} fragments of the Fc portion of an immunoglobulin, 
and wherein the C- terminus of the first polypeptide is fused 
to the N- terminus of the second polypeptide. The GLP-1 
compound may be fused to the second polypeptide via a 
25 peptide linker. It is preferable that the peptide linker is 
selected from the group consisting of: 

a) a glycine rich peptide; 

h) a peptide having the sequence [Gly-Gly-Gly-Gly-Ser] a 
where n is 1, 2, 3, 4, 5 or 6; and 
30 c} a peptide having the sequence [Gly-Gly-Gly-Gly-Ser] 3 . 

It is generally preferred that the GLP-1 compound that 
is part of the heterologous fusion protein have no more than 
6 amino acids that are different from the corresponding 
amino acid in GLP-1 (7-37) OH, GLP-1 {7-36 ) OH, or Sxendin-4 . 
35 It is even more preferred that the GLP-1 compound have no 
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more than 5 amino acids that differ from the corresponding 
amino acid in GLF-1 (7-37) OH, GLP-1 (7-36) OH, or Exendin-4. 
It is most preferred that the GLP-1 compound have no more 
than 4,. 3, or 2 araino acids that differ from the 
5 corresponding amino acid in GLP-1 (7-37) OH, GLP-1 {7-36} OH, or 
Exendin-4. Preferably, a GLP-1 compound that is part of the 
heterologous fusion protein has glycine or valine at 
position 8. 

The present invention also includes polynucleotides 
10 encoding the heterologous fusion protein described herein, 
vectors comprising these polynucleotides and host cells 
transfected or transformed with the vectors described 
herein. Also included is a process for producing a 
heterologous fusion protein comprising the steps of 
15 transcribing and translating a polynucleotide described 
herein under conditions wherein the heterolgous fusion 
protein is expressed in detectable amounts. 

The present invention also encompasses a method for 
normalizing blood glucose levels in a mammal in need thereof 
20 comprising the administration of a therapeutically effective 
amount of a heterologous fusion protein described herein. 

The invention is further illustrated with reference to 
the following drawings: 
25 Figure 1: igGl Fc amino acid sequence encompassing the 

hinge region, CH2 and CH3 domains. 

Figure 2: Human serum albumin amino acid sequence. 
Figure 3: A. SDS-PAGE gel and immunoblot of same gel 
illustrating the molecular weight of IgGl-Fc and GLP-1 -Fc 
30 fusion proteins (Lane 1, MW standards; Lane 2, Purified Fc; 
lane 3, Mock transfected media; Lane 4, Val 8 -GLP-l~Fc ; Lane 
5, Exendin-4 -Fc} B . SDS-PAGE gel and immunoblot of same gel 
illustrating the molecular weight of human HSA and GLP-1-HSA 
fusion proteins {Lane 1, MW standards; Lane 2, Purified HSA; 
35 lane 3, Mock transfected media; Lane 4, ¥al 8 -GLP- 1 -HSA ; Lane 
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5, Val 8 -GLP-1- [Gly-Gly-Gly-Gly-Ser] 3 ~HSA; Bane 6, Exendin-4- 
HSA; Lane 7 , Exendin~4~ [Gly-Gly-Gly~Gly~Ser] j-HSA) . 

Figure 4: SDS-PAGE gel of purified Fc, albumin, and 
GLP-1 fusion proteins (Lane 1, MW standards; Lane 2, 
5 purified Fc; Lane 3, Val8-GLP-1-Fc; Lane 4, Exendin-4-Fc ; 
Lane 5, KW standard; Lane 6, Val8~GLP-l-HSA; Lane 7, 
Exendin-4-HSA; Lane 8, Exendin-4- [Gly-Gly-Gly-Gly-Ser] 3 - 
HSA) . 

Figure 5: Expression cloning vector containing the Fc 
10 regions illustrated in figure 1. 

Figure 6: Expression cloning vector containing the 
albumin sequence illustrated in figure 2. 

Figure 7: Expression cloning vector containing DSJA 
encoding a 15 amino acid linker fused in frame and 5' of the 
15 albumin sequence illustrated in figure 2 . 

Figure 8: In vitro dose response activity of GLP-1 
fusion proteins. 

Figure 9; Pharmacokinetics of GLP-1 Fc and HSA. fusion 
proteins , 

20 Figure 10; Glucodynaroic response to Exendin-Fc in two 

normal fasted dogs. 

Figure 11: Insu lino tropic response to Exendin-Fc in two 
normal fasted dogs. 

Figure 12 : UNA sequence encoding a human XgGl Fc 
25 region. 

Figure 13 : DKA sequence encoding a human albumin 
protein. 

The heterologous fusion proteins of the present 
30 invention comprise a GLP-1 compound fused to human albumin, 
a human albumin analog, a human albumin fragment , ' the Fc 
portion of an immunoglobulin, an analog of the Fc portion of 
an immunoglobul in , or a fragment of the Fc portion of an 
immunoglobulin. The C- terminus of the GLP-1 compound may be 
35 fused directly, or fused via a peptide linker, to the N- 
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terminus of an albumin or Fc protein. These heterologous 
fusion proteins are biologically active artel have an 
increased half-life compared to native GLP-1. 

it is preferred that the GLP-1 compounds that make up 
S part of the heterologous fusion protein encompass 

polypeptides having from about twenty- five to about thirty- 
nine naturally occurring or ncn- naturally occurring amino 
acids that have sufficient homology to native GLP-1 ( 7-37 }OH 
such that they exhibit insul inotropic activity by binding to 

10 the GLP-1 receptor on {3-eells in the pancreas. A GLP-1 

compound typically comprises a polypeptide having the amino 
acid sequence of GLP-1 (7-37 ) OH, an analog of GLP-1 {7-37 ) OH, 
a fragment of GLP-1 (7-37) OH or a fragment of a GLP-1 {7-37} OH 
analog. GLP-1 (7-37) OH has the amino acid sequence of SEQ ID 

15 NO: 1: 

7 8 9 10 11 12 13 14 15 16 17 
His -Ala-Glu-Gly-Thr-Phe -Thr-Ser-Asp-val-Ser- 
18 19 20 21 22 23 24 25 26 27 28 
Ser-Tyr-Leu-Glu-Gly-Gln-Ala-Ala-Lys-Glu-Phe- 
2 0 29 30 31 32 33 34 35 36 37 

lle-Aia-Trp-Leu-Val-Lys-Gly-Arg-Gly 
{ SEQ ID NO: 1) 

By custom in the art, the amino terminus of GLP-1 (7 ~ 
25 37} OH has been assigned number residue 7 and the carboxy- 
termiiius, number 37. The other amino acids in the 
polypeptide are numbered consecutively, as shown in SEQ 
ID NO: 1. For example, position 12 is phenylalanine and 
position 22 is glycine, 
30 GLP-1 compounds also encompass "GLP-1 fragments , " a 

GLP-1 fragment is a polypeptide obtained after truncation of 
one or more amino acids from the iff- terminus and/or C- 
terminus of GLP-1 {7-37} OK or an analog or derivative 
thereof. The nomenclature used to describe GLP-1 {7-37 ) OH is 
35 also applicable to GLP-1 fragments- For example, GLP-1 (9- 
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36) OK denotes a GLP-1 fragment obtained by truncating two 
amino acids from the M-terminus and one amino acid from the 
C- terminus . The amino acids in the fragment are denoted by 
the same number as the corresponding amino acid in GLP-1 (7- 
5 37) OH. For example, the itf- terminal glutamic acid in GLP~ 
1 (9-36) OH is at position 9; position 12 is occupied by 
phenylalanine; and position 22 is occupied by glycine, as in 
GLP-1 (7-37) OH. For GLP-1 {7-36) OH , the glycine at position 37 
of GLP-1 (7-37) OH is deleted. 

10 GLP-1 compounds also include polypeptides in which one 

or more amino acids have been added to the M~ terminus and/ or 
C~terminus of GLP-1 {7-37 )OH, or fragments or analogs 
thereof. It is preferred that GLP-1 compounds of this type 
have up to about thirty-nine amino acids. The amino acids 

15 in the "extended" GLP-1 compound are denoted by the same 
number as the corresponding amino acid in GLP-1 {7-37) OH. 
For example, the Itf- terminus amino acid of a GLP-I compound 
obtained by adding two amino acids to the if-terminal of GLP- 
1 {7-37 (OH is at position 5; and the C- terminus amino acid of 

20 a GLP-1 compound obtained by adding one amino acid to the C~ 
terminus of GLP-1 (7-37) OH is at position 38. Thus, position 
12 is occupied by phenylalanine and position 22 is occupied 
by glycine in both of these "extended" GLP-1 compounds, as 
in GLP-1 (7-37) OH. Amino acids 1-6 of an extended GLP-1 

25 compound are preferably the same as or a conservative 

substitution of the amino acid at the corresponding position 
of GLP-1 (1-37) OH. Amino acids 38-45 of an extended GLP-1 
compound are preferably the same as or a conservative 
substitution of the amino acid at the corresponding position 

30 of glucagon or Sxendin-4 . 

GLP-1 compounds of the present invention encompass 
"GLP-1 analogs," A GLP-1 analog has sufficient homology to 
GLP-1 (7-37) OH or a fragment of GLP-1 { 7 -37) OH such that the 
analog has insul inotropic activity. Preferably, a GLP-1 

35 analog has the amino acid sequence of GLP-1 (7-37 ) OH or a 
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fragment thereof, modified so that from one, two, three, 
four or five amino acids differ from the amino acid in the 
corresponding position of GLP-1 {7-37 ) OH or a fragment of 
GLP-1 (7 -37) OH- In the nonmenclature used herein to 
5 designate GLP-1 compounds, the substituting amino acid and 
its position is indicated prior to the parent structure, 
For example, Glu 22 -GLP-1 {7-37)OH designates a GLP-1 compound 
in which the glycine normally found at position 22 of QLP™ 
1(7-37) OH has been replaced with glutamic acid? Val 8 -Glu 22 " 

10 GLP-1 {7-37} OH designates a GLP-1 compound in which alanine 
normally found at position 8 and glycine normally found at 
position 22 of GLP-1 {7-37} OH have been replaced with valine 
and glutamic acid., respectively. 

GLP-1 compounds of the present invention also include 

15 "GLP-1 derivatives." A GLP-1 derivative is defined as a 
molecule having the amino acid sequence of GLP-1 or of a 
GLP-1 analog, but additionally having chemical modification 
of one or more of its amino acid side groups, a- carbon 
atoms, terminal amino group, or terminal carboxylic acid 

20 group. A chemical modification includes, but is not limited 
to, adding chemical moieties, creating new bonds, and 
removing chemical moieties- Modifications at amino acid 
side groups include, without limitation, acylation of lysine 
E-amino groups, N- alky lat ion of arginine, histidine, or 

25 lysine, alkylation of glutamic or aspartic carboxylic acid 
groups, and dearaidation of glut amine or asparagine . 
Modifications of the terminal amino group include, without 
.limitation, the des-amino, 38- lower alkyl., M-di -lower aifcyl, 
and M-acyl modifications. Modifications of the terminal 

30 carboxy group include, without limitation, the amide, lower 
alkyl amide, dialkyl amide, and lower alkyl ester 
modifications. Lower alkyl is C3.-C4 alkyl. Furthermore, 
cue or more side groups, or terminal groups, may foe 
protected by protective groups known to the ordinarily- 
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- 10' - 

skilled protein chemist. The a-carbon of an amino acid may 
be mono- or ctimethy lated . 

Any GLP-1 compound can be part of the heterologous 
fusion proteins of the present invention as long as the GLP- 
5 1 compound itself is able to bind and induce signaling 
through the GLP-i receptor. GLP-1 receptor binding and 
signal transduction can be assessed, using in vitro assays 
such as those described in EP 519,322 and U.S. Patent No. 
5,120,712, respectively. 

10 Numerous active GLP-1 fragments, analogs and 

derivatives are known in the art and any of these analogs 
and derivatives can also be part of the heterologous fusion 
proteins of the present invention. Some examples of novel 
GLP-1 analogs as well as GLP-1 analogs and derivatives known 

15 in the art are provided herein. 

Some GLP-1 analogs and GLP-1 fragments known in the art 
include, for example, GLP-1 (7-34) and GLP-1 (7-35 ) , GLP-1 (7- 
36), Gln 9 -GLP-l{7-3?) , D-Gln^ -GLP-1 (7-37) „ Thr^-Lys^-GLP- 
1(7-37) , and Lys 18 -GLP-i (7-37) . GLP-1 analogs such as GLP- 

20 1(7-34) and GLP-i (7-35) are disclosed in 5J. S. Patent No . 

5,118,666. Biologically processed forms of GLP-1 which have 
insulino tropic properties, such as GLP-1 (7-36) are also 
known. Other known biologically active GLP-1 compounds are 
disclosed in U.S. Patent No 5,977,071 to Hoffmann, et alt, 

25 U.S. Patent No. 5,545,618 to Buckley, et al . , and Adalhorst, 
et al., J. Biol, Chem. 269:6275 (1994). 
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A preferred group of GLP--1 analogs is composed of 
GLP-1 analogs of formula I <SEQ ID NO: 2 5 

7 8 9 10 11 12 13 14 15 IS 17 
His-Xaa-Xaa~Gly-Xaa~Ph.e~Th.r~Xaa~Asp"Xaa~Xaa~ 
18 19 20 21 22 23 24 25 26 27 28 
Xa a -Xa a -Xaa -Xaa~Xaa-Xaa -Xaa -Xaa - Xaa -Xaa ~ Phe - 
29 30 31 32 33 34 35 36 37 38 39 
lle~Xaa-Xaa~Xaa-Xaa-Xaa-Xaa-Xaa~Xaa-Xaa~Xaa- 
40 41 42 43 44 45 
Xaa-Xaa-Xaa-Xaa-Xaa-Xaa 

Formula I {SSQ ID KfO: 2} 

wherein : 

Xaa at position 8 is Ala, Gly, Ser, Thr, Leu, lie, Val, 

Glu, Asp, or Lys; 
Xaa at position 9 is Glu, Asp, or Lys; 

Xaa at position 11 is Thr,. Ala, Gly, Ser, Leu, lie, Val, 

Glu, Asp, or Lys; 
Xaa at position 14 is Ser, Ala, Gly, Thr, Leu, lie, Val, 

Glu, Asp, or Lys; 
Xaa at position 16 is Val, Ala, Gly, Ser, Thr, Leu, He, 

Tyr, Glu, Asp, Trp, or Lys; 
Xaa at position 17 is Ser, Ala, Gly, Thr, Leu, lie, Val, 

Glu, Asp, or Lys; 
Xaa at position 18 is Ser, Ala, Gly, Thr, Leu, He, Val, 

Glu, Asp, Trp, Tyr, or Lys; 
Xaa at position 19 is Tyr, Phe, Trp, Glu, Asp, Gin, or Lys; 
Xaa at position 20 is Leu, Ala, Gly, Ser, Thr, lie, Val, 

Glu, Asp, Met, Trp, Tyr, or Lys; 
Xaa at position 21 is Glu, Asp, or Lys; 

Xaa at position 22 is Gly, Ala, Ser, Thr, Leu, He, Val, 

Glu, Asp, or Lys; 
Xaa at position 23 is Gin, Asn, Arg, Glu, Asp, or Lys; 
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Xaa at position 24 is Ala, Gly, Ser, Thr, Leu, lie, Val, 

Arg, Glu, Asp, or Lys ; 
Xaa at position 25 is Ala, Gly, Ser, Thr, Leu, lie, Val, 

Glu, Asp, or Lys; 
5 xaa at position 26 is Lys, Arg, Gin, Glu. Asp, or His; 
Xaa at position 27 is Leu., Glu, Asp, or Lys; 
Xaa at position 30 is Ala, Gly, Ser, Thr, Leu, lie, Val, 

Glu, Asp, or Lys; 
Xaa at position 31 is Trp, Pbe, Tyr, Glu, Asp, or Lys; 
10 Xaa at position 32 is Leu, Gly, Ala, Ser, Thr, lie, Val, 

Giu, Asp. or Lys; 
Xaa at position 33 is Val, Gly, Ala, Ser, Thr, Leu, He, 

Giu, Asp, or Lys; 
Xaa at position 33 is Asn, Lys, Arg, Glu, Asp, or His; 
15 Xaa at position 35 is Gly, Ala, Ser, Thr, Leu, He, Val, 

Glu, Asp, or Lys; 
Xaa at position 3 6 is Gly, Arg, Lys, Giu, Asp, or His; 
Xaa at position 37 is Pro, Gly, Ala, Ser, Thr, Leu, lie, 

Val, Giu, Asp, or Lys, or is deleted; 
20 Xaa at position 38 is Ser, Arg, Lys, Giu, Asp, or His, or is 

deleted; 

Xaa at position 39 is Ser, Arg, Lys, Giu, Asp, or His, or is 
deleted; 

Xaa at position 40 is Gly, Asp, Glu, or Lys, or is deleted; 
25 Xaa at position 41 is Ala,. Phe, Trp, Tyr, Glu, Asp, or Lys, 
or is deleted; 
Xaa at position 42 is Ser, Pro, Lys, Glu, or Asp, or is 
deleted; 

Xaa at position 43 is Ser, Pro, Giu, Asp, or Lys, or is 
30 deleted; 

Xaa at position 44 is Gly, Pro, Glu, Asp, or Lys, or is 
deleted; 
and 

Xaa at position 45 is Ala, Ser, Val, Glu, Asp,, or Lys, or is 
3 5 deleted; 
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provided that when, the amino acid at position 37, 38, 
39, 40, 41, 42, 43, or 44 is deleted, then each amino acid 
downstream of that amino acid is also deleted. 

It is preferred that the GLP-1 compound of formula I 
5 contain less than six amino acids that differ from the 

corresponding amino acid in GLP-1 (7-37) OH or Exend.in~4. It 
is more preferred that less than five anii.no acids differ 
from the corresponding amino acid in GLP-1 ( 7-37 } OH or 
Sxendin-4 . It is even more preferred that less than four 
10 amino acids differ from the corresponding amino acid in OLE- 
IC 7-37} CH or Excndin 4. 

GLP-1 compounds of the present invention include 
derivatives of formula 2 such as a C- 1-6- ester, or amide, or 
C-l-6-alkylamide ( or C -1-6 -dialkyi amide thereof. WO99/43706 
15 describes derivatives of GLP-1 compounds of formula I and is 
incorporated by reference herein in its entirety. The 
compounds of formula I derivatised as described in 
WO99/43706 and under ivati zed are encompassed by the present 
invention. 

20 Another preferred group of GLP-1 compounds is composed 

of GLP-1 analogs of formula II (SEQ ID WO: 3): 



7 S 9 10 11 12 13 14 15 16 17 
Xa a - Xaa-Xaa-Gly-Xaa -Xaa-Thr-Ser-Asp-Xaa-Ser- 

18 19 20 21 22 23 24 25 26 27 28 
xaa -Xaa- Leu - Glu-Gly- Xaa - Xaa - Ala -Xaa --Xaa- Phe~ 

29 30 31 32 33 34 35 36 37 
Ile~Xaa-Xaa-teu-Xaa-Xaa-Xaa-Xaa-R 
Formula II {SEQ ID $50; 3} 

wherein: 

Xaa at position -7 is: L-histidine, D-histidine, desamino- 
hi s tidine , 2 - amino-histidine , P-hydr-oxy-his tidine , 
homohistidine, a- f luoromethyl-his tidine or a-methyl- 
histidine; 
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Xaa at position 8 is: Gly, Ala, Val, Leu, lie, Ser, or Thr? 
Xaa at position 9 is: Thr, Ser, Arg, Lys, Trp, Phe,. Tyr, 
Glu, or His; 

Xaa at position 11 is: Asp, Glu, Arg, Thr, Ala, Lys, or His; 

Xaa at position 12 is: His, Trp, Phe, or Tyr; 

Xaa at position 16 is: Leu, Ser, Thr, Trp, Bis, Phe, Asp, 

Val, Tyr, Glu, or Ala; 
Xaa at position 18 is: His, Pro, Asp, Glu, Arg, Ser, Ala, or 

Lys ; 

Xaa at position 19 is: Gly, Asp, Glu, Gin, Asn, Lys, Arg, or 
Cys ; 

Xaa at position 23 is: His, Asp, Lys, Glu, Gin, or Arg; 
Xaa at position 24 is: Glu, Arg, Ala, or Lys,- 
Xaa at position 26 is: Txp, Tyr, Phe, Asp, Lys, Glu, or His; 
Xaa at position 27 is: Ala, Glu, His, Phe, Tyr, Trp, Arg, or 
Lys ; 

Xaa at position 30 is: Ala, Glu, Asp, Ser, or His; 
Xaa at position 31 is: Asp, Glu, Ser, Thr, Arg, Trp, or Lys; 
Xaa at position 33 is: Asp, Arg, Val, Lys, Ala, Gly, or Glu; 
Xaa at position 34 is: Glu, Lys, or Asp; 

Xaa at position 35 is: Thr, Ser, Lys, Arg, Trp, Tyr, Phe, 

Asp, Gly, Pro, His, or Glu; 
Xaa at position 36 is: Thr, Ser, Asp, Trp, Tyr, Phe, Arg, 

Glu, or His; 

R at position 37 is: Lys, Arg, Thr, Ser, Glu, Asp, Trp, Tyr, 
Phe, His, Gly, Gly-Pro, or is deleted. 
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Another preferred group of GLP-1 compounds is composed 
of GLP-1 analogs of formula III (SEQ ID NO: 4} : 

7 8 9 10 11 12 13 14 15 16 17 
5 Xaa-Xaa-Glu~Gly-Xaa-Xaa-Thr-Ser-Asp~Xaa-Ser~ 
18 19 20 21 22 23 24 25 26 27 28 
S er-Tyr- Leu -Glu -Xaa -Xaa- Xaa - Xaa - Lys - Xa a - Pile - 
29 30 31 32 33 34 35 36 37 
Ile-Xaa-Trp-Leu-Xaa-Xaa-Xaa-Xaa-R 
10 formula III (SEQ ID NO: 4} 



vshexrein: 

Xaa at position 7 is: L-hisfcidine, D-histidine, desainino- 
Matidine, 2 -amino-h.istid.lne , j3 ~ hydroxy -hi s t i dine , 
15 homohistidine, ct-f luoroxnethyl-histidine or a-methyl~ 

histidine; 

Xaa at position 8 is: Gly, Ala, Val, Leu, lie, Ser, or Thr; 
Xaa at position 11 is; Asp, Glu, Arg, Thr, Ala, Lys, or His; 
Xaa at position 12 is: His, Trp, Phe, or Tyr; 
20 Xaa at position 16 is: Leu, Ser, Thr, Trp, His, Phe, Asp, 

Val, Glu, or Ala; 
Xaa at position 22; Gly, Asp, Glu, Gin, Asn, Lys, Arg, or 

Cys ; 

Xaa at position 23 is: His, Asp, Lys, Glu, or Gin; 
25 Xaa at position 24 is: Glu, His, Ala, or Lys; 
Xaa at position 25 is: Asp, Lys, Glu, or His; 
Xaa at position 27 is: Ala, Glu, His, Phe, Tyr, Trp, Arg, or 
Lys ; 

Xaa at position 30 is; Ala, Glu, Asp, Ser, or His; 
30 Xaa at position 33 is: Asp, Arg, Val, Lys, Ala, Gly, or Glu; 
Xaa at position 34 is; Glu, Lys, or Asp; 

Xaa at position 35 is: Thr, Ser, Lys, Arg, Trp, Tyr, Phe, 

Asp, Gly, Pro, His, or Glu; 
Xaa at position 36 is: Arg, Glu, or His; 
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R at position 37 is: Lys, Arg, Thr, Ser, Glu, Asp, Trp, Tyr, 
Phe, His, Gly, Gly-Pro, or is delated. 

Another preferred group of GLP-1 cORspoimds is composed 
5 of GLP-1 analogs of formula IV {SEQ ID HO: 5): 

7 8 9 10 11 12 13 14 15 16 17 
Xaa -Xaa-Giu-Gly-Thr-Xaa -Thr - Ser - Asp-Xaa-Ser - 
IS 19 20 21 22 23 24 25 26 27 28 
Ser-Tyr~Leu-Glu--Xaa~Xaa-AIa-"Aia~Xaa"-Glu--Phe-- 
10 29 30 31 32 33 34 35 36 37 

lie -Xaa - Trp -Leu -Val -Lys -Xaa - Arg -R 
formula IV (SBQ ID NO; 5) 

whex-ein : 

15 Xaa at position 7 is: L-histidine, D-histidine, desamino- 
his ti, dine, 2-amino-histidine, j3-hydroxy-histidine, 
hoffiohistidine, a-t luororaethyl-histidine or a-methyl- 
histidine; 

Xaa at position 8 is: Gly, Ala, Val, Leu, He, Ser, Met, or 
20 Thr; 

Xaa at position 12 is: His, Trp, Phe, or Tyr; 

Xaa at position 16 is: Leu, Ser, Thr, Trp, His, Phe, Asp, 
Val, Gin, or Ala; 

Xaa at position 22 is: Gly, Asp, Glu, Gin, Asn, Lys, Arg, or 
25 Cys; 

Xaa at position 23 is: His, Asp, Lys, Glu, or Gin; 

Xaa at position 26 is: Asp, Lys, Glu, or His; 

Xaa at position 30 is: Ala, Glu, Asp, Ser, or His; 

Xaa at position 35 is: Thr, Ser, Lys, Arg, Trp, Tyr, Phe, 
30 Asp, Gly, Pro, His, or Glu; 

R at position 37 is: Lys, Arg, Thr, Ser, Glu, Asp, Trp, Tyr, 
Phe, His, Gly, Gly-Pro, or is deleted. 



WO 02/46227 



PCT/li SO J/43165 



Another preferred group of GLP-1 compounds is composed 
of GLP-:. analogs of formula V (SBQ ID NO: 6): 

7 8 9 10 11 12 13 14 15 16 17 
5 Xaa-Xaa-Glu-Sly-Thr-Phe-Thr-Ser-Asp-Val-Ser- 
18 19 20 21 22 23 24 25 26 27 28 
Ser--Tyr~L®u---Glu~Xaa~Xaa-Xaa-Ala~Lys~Glu--Phe-" 
29 30 31 32 33 34 35 3 6 37 
He - Xaa-Trp-Lett - val - Ly s-Gly-Ar g -R 
10 formula V (SBQ ID HO: 6} 

wherein : 

Xaa at position 7 is: L~histidine, I>-hist idine, desamino- 
hi s t i dine , 2 - amino- hi s tidine , $ -hydroxy -hi s t idine , 
homohis tidine, a-fluorcsmethyl-histidine or a-methyl- 
15 his tidine; 

Xaa at position 8 is: Gly, Ala, Val, Leu, lie, Ser, or Thr; 

Xaa at position 22 is: Gly, Asp, Glu, Gin, Asa, Lys, Arg, or 
Cys; 

Xaa at position 23 is: His, Asp, Lys, Glu, or Gin; 
20 Xaa at position 24 is: Ala, Glu, His, Phe, Tyr, Trp, Arg, or 
Lys ; 

Xaa at position 30 is: Ala, Glu, Asp, Ser., or His; 
R at position 37 is: Lys, Arg, Thr, Ser, Glu, Asp. Trp, Tyr, 
Phe, His, Gly, Gly~P.ro, or is deleted. 

25 

Preferred GLP- 1 compounds of formula I, II, III, iv, 
and V comprise GLB-1 analogs or fragments of GLP-1 analogs 
wherein the analogs or fragments contain an amino acid other 
than alanine at position 8 (position 8 analogs) . It is 

30 preferable that these position 8 analogs contain one or more 
additional changes at positions 9, 11, 12, 16, 18, 22, 23, 
24, 26, 27, 30, 31, 33, 34, 35, 36, and 37 compared to the 
corresponding amino acid of native GLP-1 {7-37 } OH. It is 
also preferable that these analogs have 6 or fewer changes 

35 compared to the corresponding amino acids in native GLP-1 {7- 
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37) OH or GLP-1 (7 -36} OH. More preferred analogs have 5 or 
fewer changes compared to the corresponding amino acids in 
native GLP-1 (7-37 ) OH or GLP-1 (7-36) OH or have 4 or fewer 
changes compared to the corresponding amino acids in native 
5 GLP-1 (7-37) OH or GLP-1 (7-36} OH. It is even more preferable 
that these analogs have 3 or fewer changes compared to the 
corresponding amino acids in native GLP-1 (7-37) OH or GLP- 
1(7-36) OH. It is most preferable that these analogs have 2 
or fewer changes compared to the corresponding amino acids 

10 in native GLP-1 (7-37} OH. 

it has been found that the compounds of formula. IX, 
ill, IV, and V have a reduced propensity to aggregate and 
generate insoluble forms. This is also important in the 
context of a fusion protein wherein the relatively small 

15 GLP-1 peptide must maintain an active conformation despite 
being fused to a much larger protein. Preferred GLP-1 
compounds of formula II, III, IV, and V encompassed by the 
fusion proteins of the present invention comprise GLP-1 
analogs or fragments of GLP-1 analogs in which glycine at 

20 position 22 and preferably alanine at position 8 have been 
replaced with another amino acid. 

When position 2 2 is aspartic acid, glutamic acid, 
arginine or lysine, position 3 is preferably glycine, 
valine, leucine, isolecine, serine, threonine or methionine 

25 and more preferably valine or glycine, When position 22 is 
a sulfonic acid such as cysteic acid, position 8 is 
preferably glycine, valine, leucine, isolecine, serine, 
threonine or methionine and more preferably valine or 
glycine . 

30 Other preferred GLP-1 compounds include GLP-1 analogs 

of formula IV SSEQ ID NO: 5) wherein the analogs have the 
sequence of GLP-1 (7-37} OH except that the amino acid at 
position S is preferably glycine, valine, leucine, 
isoleucine, serine, threonine, or methionine and more 

35 preferably valine or glycine and position 30 is glutamic 
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acid, aspartic acid, serine, or histidine and more 
preferably glufcamatic acid. 

Other preferred QW~1 compounds include 6LP-1 analogs 
of formula IV (SEQ ID NO: 5) 'wherein the analogs have the 
5 sequence of GLP~ 1 (7-37} OH except that the amino acid at 
position 8 is preferably glycine, valine, leucine, 
isoleucine, serine, threonine, or methionine and more 
preferably valine or glycine and position 37 is histidine, 
lysine, arginine, threonine, serine, glutamic acid, aspartic 

10 acid, tryptophan, tyrosine, phenylalanine and more 
prefer-ably histidine . 

Other preferred GLP-1 compounds include GLP-1 analogs 
of formula IV {SEQ ID NO; 5} wherein the analogs have the 
sequence of GLP-1 ( 7-37)OH except that the amino acid at 

15 position S is preferably glycine, valine, leucine, 

isoleucine, serine, threonine, or methionine and more 
preferably valine or glycine and position 22 is glutamic 
acid, lysine, aspartic acid, or arginine and more preferably 
glutamic acid or lysine and position 23 is lysine, arginine, 

20 glutamic acid, aspartic acid, and histidine and more 
preferably lysine or glutamic acid. 

Other preferred GLP-1 compounds include GLP-1 analogs 
of formula V (SEQ ID NO: 6) wherein the analogs have the 
sequence of GLP-1 {7-37 ) OH except that the amino acid at 

25 position 8 is preferably glycine, valine, leucine, 

isoleucine, serine, threonine, or methionine and more 
preferably valine or glycine and position 22 is glutamic 
acid, lysine, aspartic acid, or arginine and more preferably 
glutamine acid or lysine and position 27 is alanine, lysine, 

30 arginine, tryptophan, tyrosine, phenylalanine, or histidine 
and more preferably alanine. 

Other preferred GLP-1 compounds include GLP-1 analogs 
of formula II wherein the analogs have the sequence of GLP- 
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1 {7-37} OK except that the amino acid at position 8 and one, 
two, or three amino acids selected from the group consisting 
of position 9, position 11. position 12, position 16, 
position 18, position 22, position 23, position 24, position 
5 26, position 27, position 30, position 31, position 33, 
position 34, position 35, position 36, and position 37, 
differ from the amino acid at the corresponding position of 
native GLP-1 { 7-37 ) OH . 

Other preferred GLP-1 compounds of formula II include: 

10 Val 8 -GLP-l{7-37}OH, GlyS-GLP-l {7-37 } OH, Glu 22 -GLP-1 ( 7-37 } OH, 
Asp 22 -GLP-1 (7-37) OH, Arg 22 -GLP-1 (7-37 } OH, Lys 22 -GLP~1 {7- 
37) OH, Cys 22 -GLP-1 {7-37JOH, Val 8 -Gin 2 2 -GLP- 1 { 7 -3 7 } OH , Val 8 - 
Asp 22 -GLP-1 (7-37)QH, Val 8 -Arg 22 -GLP-1 (7-37) OH, Val 8 -Lys 22 - 
GLP-1 {7-37 ) OH . ValS-Cys 22 -GLP-1 (7-37}OH, Gly 8 -Glu 22 -GLP-1 [7- 

15 37} OH, Gly 8 -Asp 22 -GLP~l{7-37)0H f Gly 8 -Arg 2 2 -GLP-1 (7-37) OH, 
G ly 8 - Lys 2 2 -GLP-1 { 7 - 3 7 ) OH , Gly 8 -Cy s 2 2 -GLP-1 {7-37} OH, Glu 2 2 ~ 
GLP-1 (7-36 } OH, Asp 2 2 -GLP - 1 (7 -36) OH , Arg 2 2 - GL P - 1 ( 7 - 3 6 ) OH , 
Lys 22 -GLP-1 (7-36} OH, Cys 22 -GLP-1 {7-365 OH, Val 8 -Glu 22 -GLP- 
1 (7-36JOH, Val 8 -Asp 22 -GLP-1 (7-36)OH, Val 8 -Arg 2 2 -GLP-1 (7- 

2G 36! OH, Val 8 -Lys 22 »GLP-i (7~36)OH, Val 8 -Cya 22 -GLP-1 (7-36) OH, 
Gly 8 -GIu 22 ~GLP~l {7-36 ) OH, Gly 8 -Asp 2 2 -GLP- 1 { 7 - 3 6 ) OH , Gly 8 - 
Arg 2 2 -GLP- 1(7-36) OH , Gly 9 -Lys 2 2 -GLP- 1 ( 7 - 3 6 ) OH , Gly 8 -Cys 22 - 
GLP-1 (7-36) OH, Lys 23 -GLP-1 (7-37JOH, Val 8 -Lys 23 -GLP-1 (7- 
37) OH, Gly 8 -Lys 23 -GLP-!{7-37)OH, His 24 -GLP-1 (7-37) OH, Val.3- 

25 His 24 -GLP-l(7~37)OH, Gly 8 -Kis 2 --GLP-1 (7-37 } OH, Lys 24 -GLP- 
l(7-37)OH, Val 8 ~Lys 24 -GLP-l(7-37)OH, Gly 8 -Lys 23 -GLP-1 (7- 
37) OH, G1u30_slp_i {7-37)OH, Val 8 -Glu 30 -GLP-1 (7-37) OH, Gly 8 - 
Glu30-GLP-1(7-37)OH, Asp30- G LP-1{?~37)OH, Val 8 ~A S p 3 0-GLP~ 
1(7-37) OH, Gly 8 -Asp 3 0-GLP-1 (7-37) OH, Gln 30 -GLP~1 (7-37 ) OH, 

30 Val 8 -Gln 30 -GLP-1(7-37)OH, Gly 8 -Gin 3 8 -GLP- 1 (7-37) OH, Tyr 30 ~ 
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GLP-1 (7-37 } OH, Val 8 -Tyr 3 0 -GLP- 1 (7-37) OH, Gly 8 - Tyr 3 O-QLP-1 { 7 - 
37} OH, Ser30_GL?-l(7-37)OH f Val 8 -Ser 3 ^-GLF-1 ( 7-37 ) OH, Gly 8 - 
Ser 3 0 • GLP I { 7 - 3 7 ) OH , His 3 °-GLP-l { 7 - 3 7 ) OH , Val 8 -His 30 -GLP~ 
1 {7-37) OH , GIy s -Hi s 3 0-GLP- 1 (7-37 } OH, Glu 3 4 -GLP - 1 { 7 - 3 7 ) OH , 
5 Val 8 -GIu 34 ~GLP-l<7-37)OH, Giy 8 ~Glu 34 -GLP - 1 (7- 37 } OH, Ala 34 - 
GLP-l (7 -37} OH, Val 8 ~Ala 34 -GLP~ 1 (7-37} OH, Gly 8 - Ala 3 4 -GLP- 1(7- 
37} OH, Gly 34 ~GLP-l{7-37)OH, Val 8 -Gly 34 ~GLP-l ( 7-37) OH, Gly 8 - 
Gly 34 -GLP-1 (7-37JOH, Ala 3 5 -GLP-1 ( 7 -- 37 ) OH, Val 8 ~Ala 35 ~GLP- 
1(7-37) OH, Gly s ~Ala 3 5-GLP~l(7-37)OH, Lys 35 ~GLP-l (7-37 )OH, 

10 Val 8 -Lys 38 -GLP~l(7-37)OH, Gly 8 -Lys 35 --GLP-1 (7-3?) OH, His 35 - 
GLP-1 ( 7 - 3 7 ) OH Val 8 -His 35_ GLP-1 (7-37) OH , Gly 8 -Hi s 3 5 -GLP-1 ( 7 - 
37)OH, Pro 3 5-GLP-1{7-37}OH, Val 8 ~Pro 3 5~ G LF-l (7-37) OH, Gly 8 - 
Pro 35 -GLP~l (7-37) OH/ Glu 35 -GLP-1 (7-37 ) OH Val 8 ~Glu 3 5~GLP~l (7- 
37 } OH, Gly 8 -Glu 35 -GLP-1 (7-37 ) OH, Val 8 -Ala 27 -GLP~1 (7-37 } OH, 

15 Val 8 -His 3 7 -GLP- 1. (7-37) OK, Val 8 --Glu 22 -Lys 23 -GLP~1 (7-37) OH, 
Val 8 -Glu 2 2 -Glu 2 3 -GLP- 1(7-37) OH , Val 8 -Glu 2 2 -Ala 27 -GLP- 1(7- 
37 } OH, Val 8 -Gly 3 4 - Ly s 3 5-GI.P- 1 (7-37 ) OH, Val 8 -His 37 -GLP-1 (7~ 
37) OH, Gly 8 -His 3 7 -GLP-1 (7-37)OH, Val 8 -Glu 22 -Ala 27 -GLP-i (7- 
3-7} OH, Gly 8 -Glu 22 ~Ala 27 -GLP-l (7-37) OH, Val 8 ~Lys 22 ~Glu 23 ~GLP- 

20 1 (7-37 ) OK, and Giy 8 ~Lys 22 -Glu 23 -GLP-l ( 7 -37) OH . 

Another preferred group of GLP-1 analogs and. 
derivatives for use in the present invention is composed of 
molecules of formula VI (SSQ ID NO: 7) 

2 5 Ri - X -Glu - Gly 1 0 -Thr- Phe-Thr- Ser -Asp 1 5 - Va 1 - S er - 

Ser-Tyr-Leu 20 -Y -Gly-Gln- -Ala-Ala 2 5 -Lys- % - 
Phe- J le-Ala 3 °- Trp- Leu-Val-Lys-Gly 3 5 - Arg-Ra 
formula VI { SEQ ID HO:?} 
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wherein: Rx is selected from the group consisting of L- 
Mstidine, D-histidine, desamino-histidine , 2 ~ amino- 
histidine, p-hydroxy-histidine , homohistidine, alpha - 
f luorome thy 1-- hi s fcidine , and alpha-methyl -hi stidine ; X is 
5 selected from the group consisting of Ala, Gly, Val, Thr, 
lie, and alpha -methyl-Ala ; Y is selected from the group 
consisting of Glu, Gin, Ala, Thr, Ser, and Gly; Z is 
selected from the group consisting of Glu, Gin, Ala, Thr, 
Ser, and Gly; and R2 is Gly-OH. 

10 Another preferred group of GLP-1 compounds for use 

in the present invention is disclosed in WO 91/11457, and 
consists essentially of GLP-l(7-34), GLP-1 {7-35 ) , GLP-1 (7- 
36), or GLP-1 ( 7-37 ) , or the amide form thereof, and 
pharmaceutical!^- -acceptable salts thereof, having at least 

15 one modification selected from the group consisting of: 
(a) substitution of glycine, serine, cysteine, 
threonine; asparagine, giutamine, tyrosine, alanine, valine, 
isoleucine, leucine, methionine, phenylalanine, arginine, or 
D-lysine for lysine at position 26 and/or position 34; or 

20 substitution of glycine, serine, cysteine, threonine, 
asparagine, giutamine, tyrosine, alanine, valine, 
isoleucine, leucine, methionine, phenylalanine, lysine, or a 
D-arginine for arginine at position 36; 

Co) substitution of an oxidation-resistant amino 

25 acid for tryptophan at position 31; 

(c) substitution of at least one of: tyrosine for 
valine at position 16; lysine for serine at position 18; 
aspartic acid for glutamic acid at position 21; serine for 
glycine at position 22; arginine for giutamine at position 

30 23; arginine for alanine at position 24; and giutamine for 
lysine at position 26; and 

\d) substitution of at least one of: glycine, 
serine, or cysteine for alanine at position 8; aspartic 
acid, glycine, serine, cysteine, threonine, asparagine, 

35 giutamine, tyrosine, alanine, valine, isoleucine, leucine, 
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methionine; or phenylalanine for glutamic acid at position 
9; serine, cysteine, threonine, asparagine, glutamine, 
tyrosine, alanine, valine, isoleucine, leucine, methionine, 
or phenylalanine for glycine at position 10; and glutamic 
5 acid for aspartic acid at position 15; and 

(e) substitution of glycine, serine, cysteine, 
threonine, asparagine, glut amine, tyrosine, alanine, valine, 
isoleucine}, leucine, methionine, or phenylalanine, or the D~ 
or N-aeylated or alkylated form of histidine for histidine 

10 at position 7,- wherein, in the substitutions is (a), (b) , 

id) , and (a) , the substituted amino acids can optionally be 
in the D-form and the amino acids substituted at position ? 
can optionally foe in the N-acylated ox- N-alkylated form. 

Because the ensyme, dipeptidyl -peptidase XV {DPP IV) , 

15 may be responsible for the observed rapid in vivo 

inactivation of administered GLP-1, [see, e.g., Mentlein, 
R. , at al. , Eur, j. aiochem. , 214:829-835 (19935 1, GLP-1 
analogs and derivatives that are protected from the activity 
of DPP IV in the context of a fusion protein are preferred, 

20 and fusion proteins wherein .the GLP-1 compound is Gly 8 -GLP- 
1 { 7 - 3 7 } OH , Val 8 - GLP- 1 (7 - 37} OH, a-methyl- Ala 8 -GLP-1 (7-37) OH, 
or Gly 8 -Gin 2 1 -GLP-1 (7-37 ) OH are more preferred. 

Another preferred group of GLP-1 compounds for use in 
the present invention consists of the compounds of formula 

25 VII (SEQ ID NO: 8) claimed in U.S. Patent No. 5,512,549, 
which is expressly incorporated herein by reference. 

Ri-Ala-Glu-Gly 10 - 
Thr - Phe-Thr - Ser -Asp 1 5 -Val- Ser- S er-Tyr-Leu 2 8 - 
3 0 GIu~Gly-Gln-Ala-Ala 2 5-Xaa-Glu~ Phe - I le -Al a 3 °- 

Irp-Leu-Val-Lys-Gly 35 -Arg-R 3 
1 

formula VII (SBQ ID MO: 8) 

35 
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wherein R 1 is selected from the group consisting of 4- 
imidasopropionyl, 4-imidaaoacetyl, or 4- imidazo-- 
.a, a dimethyl-acetyl t R 2 is selected from the group 
consisting of C5-C19 unbranohed. acyl, or is absent; j s 
5 selected from the group consisting of Gly-OH or KH2; and, 

Xaa is I-ys or Arg. 

More preferred compounds of formula IV for use in the 

present invention are those in which Xaa is Arg and R 2 is 

Cg-Cio unbranohed acyl . Even more preferred compounds of 

10 formula IV fox' use in the present invention are those in 
which Xaa is Arg, R 2 is Cg-Cio unbranohed acyl, and R 3 is 
Gly-OH. Other highly- preferred compounds of formula IV for 
use in the present invention are those in which Xaa is Arg, 
R 2 is Cs-Cio unbranohed acyl, r3 ± s Gly-OH, end R 1 is 4- 

15 iinidazopropionyl . An especially preferred compound of 

formula IV for use in the present invention is that in which 
Xaa is Arg, R 2 is C 8 unbranched acyl, R 3 is Gly-OH, and R 1 
is 4 - imi da z opr opi ony 1 . 

Preferably, the GLP-1 compounds comprise GLP-1 analogs 

20 wherein the backbone for such analogs or fragments contains 
an amino acid other than alanine at position 8 (position 8 
analogs). The backbone may also inclo.de L -hist! dine, !>-■ 
histidine, or modified forms of histidine such as desamino- 
histidine, 2 -amino-hi s t idine , (3-hydroxy-histidine, 

25 homohi st idine, a- fluoromethyl -hist idine, or a- methyl - 
histidine at position 7, It is preferable that these 
position 8 analogs contain one or more additional changes at 
positions 12, 16, 18, 19, 20, 22, 25, 27, 30, 33, and 37 
compared to the corresponding amino acid of native GLP-1 (7- 

30 37) OH. It is more preferable that these position 8 analogs 
contain one or more additional changes at positions 16, 18, 
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22, 25 and 33 compared to the corresponding amino acid of 
na t i ve GLP- 1 {7-37} OH. 

In a preferred embodiment, the GLP~1 analog is GLP-1{7- 
37) OH wherein the amino acid at position 12 is selected from 
5 the group consisting of tryptophan or tyrosine. It is more 
preferred that in addition to the substitution at position 
12, the amino acid at position 8 is substituted with 
glycine, valine, leucine, isoleucine, serine, threonine, or 
methionine and more preferably valine or glycine. It is 

10 even more preferred that in addition to the substitutions at 
position 12 and 8, the amino acid at position 22 is 
substituted with glutamic acid. 

In another preferred embodiment, the GLP-l analog is 
GLP--I{7-37)OH wherein the amino acid at position 16 is 

15 selected from the group consisting of tryptophan, 

isoleucine, leucine, phenylalanine, or tyrosine. It is more 
preferred that in addition to the substitution at position 
16, the amino acid at position 8 is substituted with 
glycine, valine, leucine, isoleucine, serine, threonine, or 

20 methionine and more preferably valine or glycine. It is 

even more preferred that in addition to the substitutions at 
position 16 and 8, the amino acid at position 22 is 
substituted with glutamic acid. It is also preferred that 
in addition to the substitutions at positions 16 and 8, the 

25 amino acid at position 30 is substituted with glutamic acid. 
It is also preferred that in addition to the substitutions 
at positions 16 and 8, the amino acid at position 37 is 
substituted with histidine. 

In another preferred embodiment, the GLP-i analog is 

30 G£.P-l(7-37)OH wherein the amino acid at position 18 is 

selected from the group consisting of tryptophan, tyrosine, 
phenylalanine, lysine, leucine, or isoleucine, preferably 
tryptophan, tyrosine, and isoleucine. It is more preferred 
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that in addition to the substitution at position 18, the 
amino acid at position 8 is substituted with glycine., 
valine, leucine, i so leucine, serine, threonine, or 
methionine and more preferably valine or glycine. It is 
5 eveu more; preferred that in addition to the substitutions at 
position 18 and 8, the amino acid at position 22 is 
substituted with glutamic acid. It is also preferred that 
in addition to the substitutions at positions 18 and 8, the 
amino acid at position 30 is substituted with glutamic acid. 

10 It is also preferred that in addition to the substitutions 
at positions 1.8 and 8, the amino acid at position 37 is 
substituted with histidine 

In another preferred embodiment, the GLP-1 analog is 
GLP-1 (7-37) OH wherein the amino acid at position 19 is 

15 selected from the group consisting of tryptophan or 

phenylalanine, preferably tryptophan. It is more preferred 
that, in addition to the substitution at position 19, the 
amino acid at position 8 is substituted with glycine, 
valine, leucine, isoleucine, serine, threonine, or 

20 methionine and more preferably valine or glycine- It is 

even more preferred that in addition to the substitutions at 
position 19 and 8, the amino acid at position 22, is 
substituted with glutamic acid. It is also preferred that 
in addition to the substitutions at positions 19 and 8, the 

25 amino acid at position 30 is substituted with glutamic acid. 
It is also preferred that in addition to the substitutions 
at positions 19 and 8, the amino acid at position 37 is 
substituted with histidine 

In another preferred embodiment, the GLP-1 analog is 

30 GLP-1 (7-37) OH wherein the amino acid at position 20 is 
phenylalanine, tyrosine, or tryptophan. It is more 
preferred that in addition to the substitution at position 
20, the amino acid at position 8 is substituted with 
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glycine, valine, leucine, isoleucine, serine, threonine , or 
methionine and more preferably valine or glycine . It is 
even more preferred that in addition to the substitutions at 
position 20 and 8, the amino acid at position 22 is 
5 substituted with glutamic acid. It is also preferred that 
in addition to the substitutions at positions 20 and 8, the 
amino acid at position 30 is substituted with glutamic acid. 
It is also preferred that in addition to the substitutions 
at positions 20 and 8, the amino acid at. position 37 is 

10 substituted with histidine 

in another preferred embodiment, the GLP-1 analog is 
GLP- 1(7-37} OH wherein the amino acid at position 25 is 
selected from the group consisting of valine, isoleucine, 
and leucine, preferably valine. It is more preferred that 

15 in addition to the substitution at position 25, the amino 
acid at position 8 is substituted with glycine, valine, 
leucine, isoleucine, serine, threonine, or methionine and 
more preferably valine or glycine. It is even more 
preferred that in addition to the substitutions at position 

2 0 25 and 8, the amino acid at position 22 is substituted with 

glutamic acid. It is also preferred that in addition to the 
substitutions at positions 25 and 8, the amino acid at 
position 30 is substituted with glutamic acid. It is also 
preferred that in addition to the substitutions at positions 
25 25 and 8, the amino acid at position 37 is substituted with 
histidine. 

In another preferred embodiment, the GLP-1 analog is 
GLP-I (7-37 ) OH wherein the amino acid at position 27 is 
selected from the group consisting of isoleucine or alanine. 

3 0 It is more preferred that in addition to the substitution at 

position 27, the amino acid at position 8 is substituted 
with glycine, valine, leucine, isoleucine, serine, 
threonine, or methionine and more preferably valine or 
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glycine. It is even more preferred that in addition to the 
substitutions at position 27 and 8, the amino acid at 
position 22 is substituted with glutamic acid. It is also 
preferred that in addition to the substitutions at positions 
5 27 and 8, the amino acid at position 30 is substituted with 
glutamic acid. it is also preferred that in addition to the 
substitutions at positions 27 and 8, the amino acid at 
position 37 is substituted with histidine 

~ another preferred <-ml t ^ > the GLP-1 analog is 

10 GLP-1 (7 -37) OH wherein the amino acid at position 33 is 

isoleucine. It is more preferred that in addition to the 
substitution at position 33, the amino acid at position 8 is 
substituted with glycine, valine, leucine, isoleucine, 
serine, threonine, or methionine and more preferably valine 

15 or glycine. It is even more preferred that in addition to 
the substitutions at position 33 and 8, the amino acid at 
position 22 is substituted with glutamic acid. It is also 
preferred that in addition to the substitutions at positions 
33 and S, the amino acid at position 30 is substituted with 

20 glutamic acid, it is also preferred that in addition to the 
substitutions at positions 33 and 8, the amino acid at 
position 37 is substituted with histidine 

The GLP-1 compounds have modifications at one or more 
of the following positions; 8, 12, 16, 18, 19, 20, 22, 25, 

25 27, 30, 33, and 37. These GLP-1 compounds show increased 
potency compared with GLP-1 {7-37 ) OH and comprise the amino 
acid sequence of formula IX {SEQ ID NO; 12} 

Xaa? - Xaa 8 -Glu-G ly~ Thr - Xaa i2 - Thr - S e r ■■ As p - Xa a 16 -S er - 
Xaax8~Xaaig«Xaa20~Glu-Xaa2 3 -Gin~Ala-Xaa25~Lys~Xaa37-- 

3 0 Phe-Ile-Xaa 3 »- Trp-Leu-Xaa 33 ™Lys-Giy~Arg-Xaa3 7 

Formula IX (SEQ ID HO: 12} 
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wherein; 

Xaa 7 is: L-histidine, D-histidine, desamino-liis tiding, 2- 
amino -hist idine , (J-hydroxy-hi s t i dine , 
5 homohls t idine , a- fluororaa thy l-histidine, or a- 

roethyX-his t idine ; 
Xaa 8 is; Ala, Gly, Val, Leu, lie, Ser, or Thr; 
Xaa i2 is; Phe, Trp, or Tyr; 
Xaa is is: Val, Trp, lie, Leu, Phe, or Tyr; 
10 Xaa 18 is; Ser, Trp, Tyr, Phe, Lys, lie. Leu, Val; 
Xaa i5 is: Tyr, Trp, or Phe; 
Xaaao is: Leu, Phe, Tyr, or Trp; 
Xaa 32 is: Gly, Glu, Asp, or Lys; 
Xaa 2 s is; Ala, Val, lie, or Leu; 
15 Xaa 27 is: Glu, lie, or Ala; 

Xaa S o is: Ala or Glu 
Xaa.33 is: Val, or He; and 
Xa&37 is; Gly, His, NH 2 , or is absent, 

20 Some preferred GLP-l corapomids of formula IX include 

GLP-l (7 --37) OH, GLP-l (7-3 6 ) -NHj, Gly s -GL?-I (7-37) OH, Gly 8 -GL3?~ 
1(7-36}NH 2 , Val 8 -GLP-l(7-37)OH, Val 8 -GLP-1 (7-36 >NH 2 , Leu 8 - 
GLP-1 (7-37}OH, Leu 8 -GLP-l (7-36 >NH 2 , Ile 8 -GLP™1 (7-37)0H, Xie s - 
GLP-l(7-36>NHa, Ser 8 ~GLP-l {7-37 i OH, Ser 8 -GLP~1 (7-36}MX 3 ., 

25 Thr s -GLP»l(7-37}OH, Thr s ~GLP-l (7-36 )ME 2 , Val s -Tyr x2 -GLP-1 (7- 
37) OH, Val s -Tyr 12 -GLP-1 {7-36)NH«, Val s -«Tyr ls -GLP-l (7-37 ) OH, 
Val s -Tyr 16 -GLP-1{7--36}5SH 2 , Val 3 ~Glu 22 -GLP- 1 (7-37 ) OH, Val 8 - 
Glu 22 -GLP-l ( 7-3 6)NH 2 , Gly 8 ~Glu 32 ~GLP- 1(7-37) OH , Gly 8 -Glu s2 -GLP- 
1{7-36)NH 2 , Val 8 -Asp 22 -Gl»P-l {7-37 } OH, Val 8 - Asp 22 -GLP-l (7- 

30 36}NHj, Gly 8 -Asp 3a ~GLP-l(7-37)GH, Gly s -Asp 22 -GLP-l { 7-36) NH 2 , 
Va 1 8 - Ly s 2 2 - GL P - 1 { 7-37 ) OH, Val*-Lys 22 -GLP- 1(7-36) NH 2 , Gly S ~ 
Lys 22 -GLP- 1 { 7- 3 7 ) OH , Gly s -Lys 23 ~GLP- 1(7-36) NH 2 , Leu s -Gl u 2 2 - GL P - 
1 {7-373 OH, Leu 8 -Glu 2a -GLP-1 (7~36)HH 2 , Ile s ~Glu 22 -GLP~l (7- 
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37) OH, Xle 8 -Glu 22 -GI»P-l {7-36}SJH 2 , Leu 8 -Asp 22 -GLP-1 (7-37 ) OH,, 
Leu 8 -Asp 22 -GLP-l (7-36)KH 2 , Ile 8 -Asp 22 -GLP- 1 { 7 -37 } OH, Ue s - 
Asp aa -GLP-1 (?-36)SiH 2 , Leu 3 -Lys 22 -GLP~1 (7~37)OH, Leu 8 -Lys 22 -GLP- 
1 (7-36 )KSH 2 , Ile a -Lys 22 -GLP-1 <7-37}OH, Ile 8 -Lys 22 -GLP-l { 7- 
5 36)NH 2 , Ser 8 -Glu 22 -GLP-1 47-37) OH, Ser 8 -Glu 22 -GLP-1 (7-36)MH 2 , 
Thr 8 -Glu 22 ~GLP-l (7-37) OH, 1 s hr 8 ~Glu 22 -GX,P~l (7-36)NH 2 » Ser 8 - 
Asp 22 -GLP-l (7-37)OH, Ser 8 -Asp 22 -GLP-1 (7-36)MI 2 , Thr s »Asp 2a -GLP- 
1 (7---37SQH, Thr 8 ~Asp 22 ~GLP- 1 ( 7 - 3 6 ) NH 2 , Ser e ~Lys 22 -GX,P-l (7- 
37 ) OH, Ser 8 -Lys 22 -GLP-1 (7-36)NH 2 , Thr 8 -Lys 22 -CLP-1 { 7 -37 ) OH, 

10 Thr 8 -Lys 22 -GLP-1 (7-36 )KH 2 , Glu 23 -GLP-1 ( 7-37 } OH, Glu 22 -GLP~ 1 ( 7- 
36)NH 2 , Asp 22 -GLP-l(7-37)OH, Asp 22 -GLP-1 ! 7-36} HH 2 , Lys 22 -GLP~ 
1 (7-37) OH, Lys 22 -GLP-1 (7-36)J?H 2 , Val 8 -Ala 27 -GLP-1 (7-37) OH, 
Val 8 -Glu 22 -Ala 27 -GLP-1 (7-37)GH, Val 8 -Glu 50 - GLP-l (7-37)OH, Val 8 - 
Glu 30 -GLP~1(7-36)NH 2 , Gly 8 -Glu 3c ~GLP»l (7~37)OH, Gly 8 -Qlu 30 -GLP- . 

15 1(7-36)MH 2 , L©u 8 -Glu 3i) -GLP-1 (7-37 ) OH, Leu 8 -Glu 30 ~GLP~l (7- 

36) HH 2 , Ile 8 -Glu 30 -GLP-1 {7-37)OH, Ile s ---Glu 30 -GLP-1 {7-36jMHa, 
Ser 8 -Glu 33 -GLP-l (7-37) OH, Ser 8 ~Glu 30 ~GLP-l (7-36)NH 2 , .Thr 8 - 
Glu 30 -GLP~l(7-37 ) OH, Thr 8 ~Glu 3a -GLP~l {7-36}Mi 2 , Val 8 -His 37 -GLP~ 
1 (7-37) OH, Val 8 -His 37 -GLP-1 <7-36)MH 2 , Gly s ~His 37 ~GLP-l (7-37 )OH, 

20 Gly 8 -Hi8 3 '' -GLP-l (7 -36)Mi 2 , Leu 8 ~His 37 -GLP-l (7-37) OH, Leu 8 -His 37 ~ 
GLP-1 (7-36)NH 2( Ile s ~His 37 ~GLP~l (7-37JOH, Ile 8 -His 3? -GLP-1 (7- 
3 6 } NH 2 , Ser 8 -Ki s 37 -GLP- 1 (7-37) OH , Ser 8 - Hi s 37 -GLP- 1 ( 7 -3 6 ) NK 2 , 
Thr 8 -His 37 -GLP~1 (7-37) OH, Tbr 8 -His 37 -GLP-1 (7-36 )NH». 

Some preferred GLP-l compounds of formula IX having 

25 multiple substitutions include GLP-l (7-37) OH wherein 

position 8 is valine or glycine, position 22 is glutamic 
acid, position 16 is tyrosine, leucine or tryptophan, 
position 18 is tyrosine, tryptophan, or isoleucine, position 
25 is valine and position 33 is isoleucine. Other preferred 

30 GLP-l compounds include the following: Val 8 -Tyr ll5 -GLP-1 > (7- 

37) OK, Val*~Tyr"-Glu 22 -Gl.P-l (7-37} OH, Val s -Tyr 16 -Phe ls -GLP- 
1 (7-37 ) OH, Va! s ~Tyr ls --Glu 22 -GLP - l (7-37) OH, Val 8 -Trp"-Glu" - 
GLP-l (7-37) OH, Val s -Leu 26 -Glu 2a -GLP-1 (7-37) OH, Val 8 -Il.e ls - 
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Glu 22 -GLP- 1 ( 7- 37 } OH, Val 8 -Phe i6 -Glu 22 -GLP-1 (7-37) OH; Val H ~ 
Trp 18 ~Glu 22 -GLP~l {7-37JOH, Val 8 -Tyr 13 ~Glu 22 -GLP-l {7-37}OH, 
Val 8 -Phe x8 -Glu 22 ~GLP~l < 7 -37) OH, and Val 8 -Ile i8 -Glu 22 -GLP-1 (7- 
37) OH. 

5 The GLP-1 compounds of the present invention also 

encompass Exendin compounds. Exendin- 3 and Exendin --4 are 
biologically active peptides first isolated from 
Helcdermatidae lizard venoms and have been shown to bind the 
GLP-1 receptor and stimulate CAMP- dependent H + production in 

10 mammalian parietal ceils. Exendin- 3 and Exendin- 4 are both 
39 amino acid peptides which are approximately 53% 
homologous to GLP-1 . They act as potent agonists of GLP-1 
activity. Notably, an terminally truncated derivative of 
Exendin, known as Exendin (9-39 amino acids}, is an inhibitor 

15 of Exendin- 3, Exendin- 4 and GLP-1. 

An Exendin compound typically comprises a polypeptide 
having the amino acid sequence of Exendin- 3, Exendin- 4, or 
an analog or fragment thereof. Exendin-3 and Exendin- 4 are 
disclosed in U.S. Patent No. 5,424,286. 

20 

Exendin -3 has the amino acid sequence of SEQ IB HO: 9: 
7 8 9 10 11 12 13 14 15 16 17 
Hi s - Ser-Asp-Giy-Thr-Phe-Thr - Ser -Asp-Leu-Ser- 
IS 19 20 21 22 23 24 25 26 27 28 

2 5 Lys -G In-Met -Glu-Glu-Glu-Al a -Val -Ar g -Leu - Phe ~ 

29 30 31 32 33 34 35 36 37 38 39 
Ile-Glu-Trp-Leu-Lys-Asn-Gly-Gly-Pro-Ser-Ser- 
40 41 42 43 44 45 
Sly- Ala -Pro -Pro -Pro -Ser 

30 f SEQ ID SO: 9) 
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Exendin- 4 has the amino acid sequence of SSQ ID NO; 10: 
7 8 9 10 11 12 13 14 15 16 17 
His-01y-01u-Gly~Tlir-Phe-Thr-S6r-Asp-Leu-Ser~ 
5 18 19 20 21 22 23 24 25 26 27 28 

I.,y s -Gin -Met -Glu .-Glu-Gl u- Ala-Va 1 - Ar g -Leu - Phe - 
29 30 31 32 33 34 35 36 37 38 39 
Ile-Glu-Trp-iieu-Lys-Asn-Gly-Gly-Pro-Ser-Ser- 
40 41 42 43 44 45 
10 Gly- Ala - Pro - Pro - Pro- Ser 

<SEQ ID NO: 10} 



GLP-1 compounds also include Exendin fragments which 
are polypeptides obtained after truncation of one or more 

15 amino acids from the N~ terminus and/or C-terminus of Exendin 
or an Exendin analog. Furthermore, GLP-1 compounds include 
Exendin polypeptides in which one or more amino acids have 
been added to the N~ terminus and/or C- terminus of Exendin or 
fragments thereof . Exendin compounds of this type have up to 

20 about forty- five amino acids. 

GLP-1 compounds also include "Exendin analogs . * An 
Exendin analog has sufficient homology to Exendin- 4, 
Exendin-3, or a fragment thereof such that the analog has 
insulinotropie activity. The activity of Exendin fragments 

25 and/or analogs can be assessed using in vitro assays such as 
those described in EP 619,322 and U.S. Patent Ho. 5,120,712. 

Preferably, an Exendin analog has the amino acid 
sequence of Exendin-4 or a fragment thereof, modified so 
that from one, two, three, four or five amino acids differ 

30 from the amino acid in cor responding position of Exendin- 4 
or the fragment of Exendin~4. In the nonmenclature used 
herein to designate Exendin compounds, the substituting 
amino acid and its position, is indicated prior to the parent 



WO 02/46227 



PCT/li SO J/43165 



- 33 - 

structure. For example, Val 8 -Bxendin~ 4 designates an 
Exendin compound in which the glycine normally found at 
position 8 of Sxendin-4 has been replaced with valine. 

Another preferred group of GI>P- 1 compounds is composed 
5 of GLP-1 /Exendin- 4 analogs of formula VIII (SSQ ID HO: 11} . 

? 8 9 10 11 12 13 14 15 16 17 
Xaa~Xaa-Glu-Giy~Thr- Phe-Thr- Ser-A sp-Xaa~ Ser - 
IS 19 20 21 22 23 24 25 26 27 28 
1 0 Xaa - Xaa-Xaa-Glu-Xaa-Xa a~A 1 a - Xaa~Xaa~Xaa~ Phe- 

29 30 31 32 33 34 35 36 3? 
I 1 e - Xa a - T r p - Leu - Xaa - Xaa -Gly-Xaa-R 
formula VIII (SSQ ID NO: 11} 

15 wherein: 

Xaa at position 7 is: L-hisfcidine, D-histidxne, desamino- 
histidine, 2-amino~histidine, ^-hydroxy-histidine, 
ho:nohi s t i dine , a-fluoromethyl-Mstidine or a-methyl- 
histidine; 



Xaa 


at 


position 


8 is: 


Gly, Ala, or Val; 


Xaa 


at 


position 


16 




Leu or Val; 


Xaa 


at 


position 


18 


is 


Lys or Ser; 


Xaa 


at 


position 


19 


is: 


Gln or Tyr; 


Xaa 


at 


position 


20 


.1 s : 


Met or Leu; 


Xaa 


at 


position 


22 


is: 


Glu or Gin; 


Xaa 


at 


position 


23 


is 


Glu, or Gln; 


Xaa 


at 


position 


25 


is 


Val or Ma; 


Xaa 


at 


position 


2S 


is 


Arg or Lys; 


Xaa 


at 


position 


27 


is 


Leu or Glu; 


Xaa 


at 


position 


30 


is 


Glu or Ala; 


Xaa 


at 


position 


33 


is 


Val or Lys; 


Xaa 


at 


position 


34 


is 


Asn or Lys; 


xaa 


at 


position 


36 


is 


Gly or Arg; and 
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R at position 37 is: Gly, Pro, Pro-Ser-Ser-Gly-Ala-Pro-Pro- 
Pro-Ser, or is absent. The activity of IS different species 
that fail within this genus is provided in Table 6. 

Further Sxendin-analogs that are useful for the present 
5 invention are described in PCT patent publications WO 

99/25728 (Beeley et al.}, WO 99/25727 Beeley et al.), WO 
98/05351 (Young et al.) , WO 99/40788 (Young et al.), WO 
99/07404 (Beeley et al) , and WO 99/43708 (Knudsen et al) . 

The GLP-1 fusion proteins of the present invention can 

10 comprise glycosylation Bites. Glycosylation is a chemical 
modification wherein sugar moieties are added to the protein 
at specific sites. Glycosylation of proteins play a role in 
ensuring the correct, charge, confirmation, and stability of 
maturing protein and can target the protein to the cell 

15 surface and eventual secretion of the protein. Host 

importantly, glycosylation effects the in vivo clearance 
"rate for many proteins. Sugars can be 0-1 inked or KF- linked. 
Generally, 0- linked sugars are added to the hydroxy! -group 
oxygen of serine and threonine, while N- linked sugars are 

20 added to the amide nitrogen of asparagine. The consensus 
site for N-glycosylation is &sn XI X2 wherein XI is any 
amino acid except Pro and X2 is Sen or Thr, 

GLP-1 compounds are generally not glycosylated in vivo; 
however, interestingly the GLP-1 fusion proteins of the 

25 present invention that comprise a GLP-1 compound with a C 
terminal extension fused to an Fc sequence is glycosylated 
at the last serine in the C terminal extension {SSGAPPPS*} 
and at threonine at position 11 in the N terminal .region of 
FC <AEPKSCDKTHT*CPPC . . 



30 
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Meter ol ogou sJFc_ fusion proteins: 

The GLP-1 compounds described above can. be fused 
directly or via a peptide linker to the Fc portion of an 
5 immunoglobulin . 

Immunoglobulins are molecules containing polypeptide 
chains held together by disulfide bonds, typically having 
two light chains and two heavy chains. In each chain, one 
domain (V) has a variable amino acid sequence depending on 

10 the antibody specificity of the molecule. The other domains 
CO have a rather constant sequence common to molecules of 
the same class. 

As used herein, the Fc portion of an immunoglobulin has 
the meaning commonly given to the term in the field of 

15 immunology. Specifically, this term refers to an antibody 
fragment which is obtained by removing the two antigen ' 
binding regions {the Fab fragments) from the antibody. One 
way to remove the Fab fragments is to digest the 
immunoglobulin with papain protease. Thus, the Fc portion 

20 is formed from approximately equal sised fragments of the 
constant region from both heavy chains, which associate 
through non-covalent interactions and disulfide bonds. The 
Fc portion can include the hinge regions and extend through 
the CH2 and CK3 domains to the C- terminus of the antibody. 

25 Representative hinge regions for human and mouse 

immunoglobulins can be found in Antibody Engineering, A 
Practical Guide, Borrebaeck, C.A.K., ed. , W.H. Freeman and 
Co., 19.92, the teachings of which are herein incorporated by 
reference. The Fc portion can further include one or more 

30 glycosylation sites. The amino acid sequence of a 

representative Fc protein containing a hinge region,. CH2 and 
CH3 domains, and one W-glycosylation site at position 82 is 
shown in Figure 1, 
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There are five types of human immunogiobul in Fc regions 
with different effect or and pharmacokinetic properties: 
IgG, IgA, IgM, IgD, and IgE. IgG is the most abundant 
immunoglobulin in serum. IgG also has the longest half-life 
5 in serum of any immunogiobul in {23 days) . Unlike other 

imnamoglobulins , IgG is efficiently recirculated following 
binding to an Fc receptor. There are four IgG subclasses 
Gl, G2, G3, and G4, each of which have different effect or 
functions. Gl, G2, and G3 can bind Clq and fix complement 

10 while G4 cannot. Even though G3 is able to bind Clq more 
efficiently than Gl, Gl is more effective at mediating 
complement-directed coll lysis. G2 fixes complement very 
inefficiently. The Clq binding site in IgG is located at 
the carboxy terminal region of the CH2 domain. 

15 All IgG subclasses are capable of binding to Fc 

receptors (CDlS, CD32, CD64) with Gl and G3 being more 
effective than G2 and 04. The Fc receptor binding region of 
IgG is formed by residues located in both the hinge and the 
carboxy terminal regions of the CH2 domain. 

20 IgA can exist both in a monomeric and dimeric form held 

together by a J~chain. IgA is the second most abundant Ig 
in serum, but it has a half -life of only 6 days. IgA has 
three effect: or functions. It binds to an IgA specific 
receptor on macrophages and eosinophils, which drives 

25 phagocytosis and degranulation, respectively. It can also 
fix complement via an unknown alternative pathway. 

IgM is expressed as either a pentamer or a hexamer, 
both of which are held together by a J-chain. IgM has a 
serum half -life of 5 days. It binds weakly to Clq via a 

30 binding site located in its CH3 domain. IgD has a half-life 
of 3 days in serum. It is unclear what effect or functions 
are attributable to this Ig. IgE is a monomeric Ig and has 
a serum half -life of 2.5 days. IgE binds to two Fc 
receptors which drives degranulation and results in the 

35 release of proinflammatory agents. 
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Depending on the desired in vivo effect, the 
heterologous fusion proteins of the present invention may 
contain any of the i so types described above or may contain 
mutated Pc regions wherein the complement and/ or Fc receptor 
5 binding functions have been altered. Thus, the heterologous 
fusion proteins of the present invention may contain the 
entire Fc portion of an immunoglobulin, fragments of the Fc 
portion of an immunoglobulin, or analogs thereof fused to a 
GLP-1 compound- 

10 The fusion proteins of the present invention can 

consist of single chain proteins or as multi-chain 
polypeptides . Two or more Fc fusion proteins can foe 
produced such that they interact through disulfide bonds 
that naturally form between Fc regions. These multimers can 

15 be homogeneous with respect to the GLP-1 compound or they 
may contain different GLP-1 compounds fused at the H- 
fcerminus of the Fc portion of the fusion protein , 

Regardless of the final structure of the fusion 
protein, the Fc or Fc-like region must serve to prolong the 

20 in vivo plasma half-life of the GLP-1 compound fused at the 
Ist-terminus , Furthermore, the fused , GLP--1 compound must 
retain some biological activity. An increase in half -life 
can be demonstrated using the method described in Example 7 
wherein the half -life of the fusion protein is compared to 

25 the half -life of the GLP-1 compound alone. Biological 

activity can be determined by in vitro and in vivo methods 
know in the art. Representative biological assays are 
described in Examples 6, 8, and 9. 

Since the Fc region of IgG produced by proteolysis has 

30 the same in vivo half -life as the intact IgG molecule and 
Fab fragments are rapidly degraded, it is believed that the 
relevant sequence for prolonging half-life reside in the CII2 
and/or CH3 domains. Further, it has been shown in the 
literature that the catabolic rates of IgG variants that do 

35 not bind the high-affinity Fc receptor or Clq are 
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indistinguishable from the rate of clearance of the parent 
wild- type antibody., indicating that the catabolic site is 
distinct from the sites involved in Fc receptor or Clq 
binding. [Wawrzynczak et al., {1992} Molecular Immunology 
5 29:221], Site-directed mutagenesis studies using a murine 
IgGl Fc region suggested that the site of the IgGl Fc region 
that controls the catabolic rate is located at the CH2-CH3 
domain interface. 

Based on these studies, Fc regions can be modified at 

10 the catabolic site to optimise the half -life of the fusion 
proteins. It is preferable that the Fc region used for the 
heterologous fusion proteins of the present invention be 
derived front an IgGl or an IgG4 Fc region. It is even more 
preferable that the Pc region be IgG4 or derived from XgG 4 , 

15 Preferably the IgG Fc region contains both the CH2 and CH3 
regions including the hinge region, 

ologous albu min fus ion prot&i ns?£ 

The GLP-1 compounds described above can be fused 

20 directly or via a peptide linker to albumin or an analog, 
fragment, or derivative thereof. 

Generally the albumin proteins making up part of the 
fusion proteins of the present invention can be derived from 
albumin cloned from any species . However, human albumin and 

25 fragments and analogs thereof are preferred to reduce the 
risk of the fusion protein being immunogenic in humans. 
Human serum albumin (HSA) consists of a single non- 
glycosylated polypeptide chain of 585 amino acids with a 
formula molecular weight of 66,500. The amino acid sequence 

30 of human HSA is shown in figure 2. [See Meloun, et al. 
(1975) FEES Letters 58:136; Behrens, et al . (1975} Fed. 
Proc. 34:591; Lawn, et al, (1981) Nucleic Acids Research 
9:6102-6114; Minghetti, et al. (1986) J. Biol . Chem. 
261:6747 3. A variety of polymorphic variants as well as 

35 analogs and fragments of albumin have been described. [See 
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Weitkamp, efc si., (1973) Ann. Hum. Genet. 37:219]. For 
example,- in EP 322,094, the inventors disclose various 
shorter forms of HSA. Some of these fragments include 
HSA< 1-373) , HSA (1-388) , HSA<l-389) , HSA {1-3 69) , and HSA(1~ 
5 419) and fragments between 1-369 and 1-419. SP 399,666 
discloses albumin fragments that include HSA (1-177) and 
HSA { 1-200) arid fragments between HSA(1-177) and HSA{l-200) . 

It is understood that the heterologous fusion proteins 
of the present invention include GLP-l compoxxnds that are 

10 coupled to any albumin protein including fragments, analogs, 
and derivatives wherein such fusion protein is biologically 
active and has a longer plasma half- life than the GLP-l 
compound alone. Thus, the albumin portion, of the fusion 
protein need not necessarily have, a plasma half -life equal 

15 to that of native human albumin. Fragments, analogs, and 
derivatives are .known or can be generated that have longer 
half-lives or have half-lives intermediate to that of native 
human albumin and the GLP-l compound of interest. 

The heterologous fusion proteins of the present. 

20 invention encompass proteins having conservative amino acid 
substitutions in the GLP-l compound and/or the Fc or albumin 
portion of the fusion protein. A ^conservative 
substitution" is the replacement of an amino acid with 
another amino acid that has tee same net electronic charge 

25 and approximately the same size and shape. Amino acids with 
aliphatic or substituted aliphatic amino acid side chains 
have approximately the same size when the total number 
carbon and heteroatorns in their side chains differs by no 
more than about four. They have approximately the same 

30 shape when the number of branches in their side chains 

differs by no more than one. Amino acids with phenyl or 
substituted phenyl groups in their side chains are 
considered to have about the same size and shape. Except as 
otherwise specifically provided herein, conservative 
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substitutions are preferably made with naturally occurring 
amino acids , 

However,, the term "amino acid" is used herein in its 
broadest sense, and includes naturally occurring amino acids 
5 as well as non-natural ly occurring amino acids, including 
amino acid analogs and derivatives. The latter includes 
molecules containing an amino acid moiety. One skilled in 
the art will recognize, in view of this broad definition, 
that reference herein to an amino acid includes, for 

10 example, naturally occurring proteogenic L-amino acids; D- 
araino acids; chemically modified amino acids such as amino 
acid analogs and derivatives; xxaturaily occurring non- 
protaogenic amino acids such as norleucine, ^-alanine, 
ornithine, 6 ABA, etc.; and chemically synthesized compounds 

15 having properties known in the art to be characteristic of 
amino acids. As used herein, the term "proteogenic" 
indicates that the amino acid can be incorporated into a 
peptide, polypeptide, or protein in a cell through a 
metabolic pathway. 

20 The incorporation of non-natural amino acids, including 

synthetic non-native amino acids, substituted amino acids, 
or one or raore D-airdno acids into the I > \ .» fusion 

proteins of the present invention can be advantageous in a 
number of different ways. D~artd.no acid-containing peptides, 

25 etc., exhibit increased stability in vitro or in vivo 

compared to L- amino acid-containing counterparts. Thus, the 
construction of peptides, etc., incorporating D- amino acids 
can be particularly useful when greater intracellular 
stability is desired or required. More specifically, D- 

30 peptides, etc., are resistant to endogenous peptidases and 
proteases, thereby providing improved .bioavailability of the 
molecule, and prolonged lifetimes in vivo when such 
properties are desirable. Additionally, D-peptides, etc., 
cannot be processed efficiently for major histocompatibility 

35 complex class U-restricted presentation to T helper cells. 
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and are therefore, less likely to induce humoral immune 
responses in the whole organism. 

In addition to structure/ function analyses of the 
various polypeptides encompassed by the present invention,. 
5 there are numerous factors that can foe considered when 

selecting amino acids for substitution. One factor that can 
foe considered in making such changes is the hydropathic 
index of amino acids. The importance of the hydropathic 
amino acid index in conferring interactive biological 

10 function on a protein has been discussed by Kyte and 
Doolittle (1982, J. Mol. Biol., 157: 105-132). It is 
accepted that the relative hydropathic character of amino 
acids contributes to the secondary structure of the 
resultant protein. This, in turn, affects the interaction 

15 of the protein with molecules such as enzymes, substrates, 
receptors, ligands, DMA, antibodies, antigens, etc. Based 
on its hydrophobicity and charge characteristics, each amino 
acid has been assigned a hydropathic index as follows; 
isoleucine (+4.5) ; valine (+4,2); leucine £ +3 . 8 ) ? 

20 phenylalanine (4-2.8),- cysteine/cystine {+2.5}; methionine 
(+1.9); alanine (4-1.8); glycine (-0.4),- threonine (-0.7); 
serine {-0.8} ; tryptophan (-0.9) ; tyro sine ( -1.3 ) ; proline 
(-1.6} histidine { -3 . 2 ) ; 

glu tama t e / g lut amine /aspartate/ aspar agine (-3 .5) ; lysine 

25 {-3.9) ; and arginine (-4.5). 

As is known in the art, certain amino acids in a 
peptide, polypeptide, or protein can be substituted for 
other amino acids having si similar hydropathic index or 
score and produce a resultant peptide, etc., having similar 

30 or even improved biological activity. In making such 
changes, it is preferable that amino acids having 
hydropathic indices within ±2 are substituted for one 
another. More preferred substitutions sire those wherein the 
amino acids have hydropathic indices within ±1. Most 
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preferred substitutions are those wherein the amino acids 
have hydropathic indices within ±0.5. 

Like amino acids can also be substituted on the basis 
of hydr ophi lici ty . U.S. Patent Ho. 4, 554,101 discloses that 
5 the greatest local average hydr ophi lici ty of a protein, as 
governed by the hydrophilicity of its adjacent amino acids, 
correlates with a biological property of the protein. The 
following hydropbi 1 ici t y values have been assigned to amino 
acids: arginine/ lysine {+-3.0); aspartate /glut amate (+3.0±1); 

10 serine {+0.3) ; a sparagine/ glut amine ( + 0.2); glycine {0} ; 
threonine (-0.4); proline {-0.5±1> ; alanine/histidine 
{-0.5}; cysteine (-1.0); methionine (-1.35; valine {-1.5}? 
leucine ./ isoleuc ine {-1,8) ; tyros ine (-2.3) ; phenyl alanine 
(-2-5); and tryptophan {-3.4). Thus, one amino acid in a 

15 peptide, polypeptide, or protein can foe substituted by 

another amino acid having a similar hydrophilicity score and 
still produce a resultant peptide, etc.,. having similar 
biological activity, i.e., still retaining correct 
biological function. In making such changes, amino acids 

20 having hydropathic indices within. ±2 are preferably 
substituted for one another, those within ±1 are more 
preferred, and those within ±0.5 are most preferred. 

As outlined above, amino acid substitutions in the 
fusion proteins of the present invention can be based on the 

25 relative similarity of the amino acid side- chain 
substituents, for example, their hydrophobic! ty , 
hydrophilicity, charge, size, etc. Furthermore, 
substitutions can be made based on secondary structure 
propensity. For example, a helical amino acid can foe 

30 replaced with an amino acid that would preserve the helical 
structure. Exemplary substitutions that take various of the 
foregoing characteristics into consideration in order to 
produce conservative amino acid changes resulting in silent 
changes within the present peptides, etc., can be selected 

35 from other members of the class to which the naturally 
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occurring amino acid belongs, Amino acids can be divided 
into the following four groups: (!) acidic amino acids; (2) 
basic amino acids; (3) neutral polar amino acids; and (4} 
neutral non-polar amino acids. 

5 

Gen eral me thods for making' the het erologous fu s ion pr oteins 
of the pre s ent inxrssici on . 

Although the heterologous fusion proteins of the 
present invention can be made by a variety of different 

10 methods, recombinant methods are preferred. For purposes of 
the present invention, as disclosed and claimed herein, the 
following general molecular biology terms and abbreviations 
are defined below. The terms and abbreviations used in this 
document have their normal meanings unless otherwise 

15 designated. For example, "°C refers to degrees Celsius; 
w mmoX" refers to millimole or miilimoles; "mg* refers to 
milligrams; *\ig" refers to micrograms; "mi or mU" refers to 
milliliters; and "jil or fXh" refers to microliters. Amino 
acids abbreviations are as. set forth in 3? C.F.R. § 1.822 

20 (b) {2} (1994) . 

"Base pair" or "bp" as used herein refers to DMA or 
RNA. The abbreviations A,C,G, and T correspond to the 5'~ 
monophosphate forms of the deoxyribonucleo sides 
( deoxy ) adenosine .. { deoxy ) cyt idine , ( deoxy ) guano sine, and 

25 thymidine, respectively, when they occur in DHA molecules. 
The abbreviations U, C,G, and A correspond r.o the 5'- 
monophosphate forms of the ribonucleotides uridine, 
cytidine, guanosine, and adenosine, respectively when they 
occur in RNA molecules. In double stranded DMA, base pair 

30 may refer to a partnership of A with T or C with G. In a 
DHA /RNA r heterodupiex base pair may refer to a partnership 
of A with U or C with 0. {See the definition of 
"complementary" , infra. 5 

"Digestion"" or "Restriction" of DHA refers to the 

35 catalytic cleavage of the DNA with a restriction enzyme that 
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acts only at certain sequences in the DHA ( "sequence- 
specific endonuc leases ") . The various restriction enzymes 
•used .herein are commercially available and their reaction 
conditions, cof actors, and other requirements wei-e used as 
5 would be known to one of ordinary skill in the art. 

Appropriate buffers and substrate amounts for particular 
restriction enzymes are specified by the manufacturer or can 
be readily found in the literature. 

"ligation" refers to the process of forming 

10 phospliodi ester bonds between two double stranded nucleic 

acid fragments. Unless otherwise provided, ligation may be 
accomplished using known buffers and conditions with a DNA 
ligase, such as T4 DHA ligase. 

"Plasmid" refers to an ejstrachromosomal (usually} self- 

15 replicating genetic element. Plasmids are generally 

designated by a lower case "p" followed by letters and/or 
numbers . The starting piasmids herein are either 
commercially available, publicly available on an 
unrestricted basis,, or can be constructed from available 

20 piasmids in accordance with published procedures. In 

addition, equivalent piasmids to those described are known 
in the art and will be apparent to the ordinarily skilled 
artisan. 

"Recombinant DMA cloning vector'' as used herein refers 
25 to any autonomously replicating agent, including, but not 
limited to, plasmids and phages, comprising a PIMA molecule 
to which one or more additional DPA segments can or have 
been added. 

"Recombinant DNA expression vector" as used herein 
30 refers to any recombinant DMA cloning vector in which a 

promoter to control transcription of the inserted DHA has 
been incorporated. 

^Transcription" refers to the process whereby 
information contained in a nucleotide sequence of DMA is 
35 transferred to a complementary RNA sequence. 
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"Trans faction" refers to the uptake of an expression 
vector by a host cell whether or not any coding sequences 
are, in fact, expressed. Numerous methods of trans feet ion 
are known to the ordinarily skilled artisan, for example, 
5 calcium phosphate co -precipitation, liposome fcransf ection, 
and exec troporat ion. Successful trans faction is generally 
recognized when any indication of the operation of this 
vector occurs within the host cell. 

* Transformation* refers to the introduction of DMA into 

10 an organism so that the DNA is replieable, either as an 
extrachromosomal element or by chromosomal integration. 
Methods of transforming bacterial and eukaryotic hosts are 
well known in the art, many of which methods, such as 
nuclear injection,, protoplast fusion or by calcium treatment 

15 using calcium chloride are summarized in J . Sambrook, et 
al. , Molecular Cloning: A Laboratory Manual, (1989). 
Generally, when introducing DMA into Yeast the term 
transformation is used as opposed to the term trans f ection. 
* Translation" as used herein refers to the process 

20 whereby the genetic information of messenger KKA (mRNA) is 
used to specify and direct the synthesis of a polypeptide 
chain. 

"Vector" refers to a nucleic acid compound used fox' the 
transfection and /or transformation of ceils in gene 

25 manipulation bearing polynucleotide sequences corresponding 
to appropriate protein molecules which, when combined with 
appropriate control sequences, confers specific properties 
on the host cell to be transfected and /or orans formed. 
Fiasxnids, viruses, and bacteriophage are suitable vectors. 

30 Artificial vectors are constructed by cutting and joining 
DNA molecules from different sources using restriction 
enzymes and 1 leases. The term "vector" as used herein 
includes Recombinant DN& cloning vectors and Recombinant DHA 
expression vectors. 
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«Conplementary" or "Complementarity", as used herein, 
refers to pairs of bases (purines and pyrimi dines) that 
associate through hydrogen bonding in a double stranded 
nucleic acid. The following base pairs are contpl eaien fcary : 
5 guanine and cytosine; adenine and thymine; and adenine and 
uracil. 

* Hybrid! zat ion" as used herein refers to a process in 
which a strand of nucleic acid joins with a complementary 
strand through base pairing. The conditions employed in the 
10 hybridisation of two non- identical, but very similar, 
complementary nucleic acids varies with the degree of 
complementarity of the two strands and the length of the 
strands. Such techniques and conditions are well known to 
practitioners in this field. 
15 "Isolated amino acid sequence" refers to any amino acid 

sequence, however, constructed or synthesised, which is 
locationally distinct from the naturally occurring sequence. 

"Isolated DMA compound" refers to any BJSSA sequence, 
however constructed or synthesized, which is locationally 
20 distinct from its natural location in genomic DJSA. 

"Isolated nucleic acid compound'" refers to any RHA or 
DHA sequence, however constructed or synthesized, which is 
locationally distinct from its natural location. 

"Primer" refers to a nucleic acid fragment which 
25 functions as an initiating substrate for enzymatic or 
synthetic elongation. 

"Promoter" refers to a DNA sequence which directs 
transcription of DMA to 33HA. 

"Probe" refers to a nucleic acid compound or a 
3 0 fragment, thereof, which hybridizes with another nucleic 
acid compound . 

"Stringency" of hybridization reactions is readily 
, determinable by one of ordinary skill in the art, and 

generally is an empirical calculation dependent upon probe 
35 length, washing temperature, and salt concentration. In 
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general, longer probes require higher temperatures for proper 
annealing, while short probes need lower temperatures . 
Hybrid.i2e.ti0n generally depends on the ability of denatured 
DMA to re-anneal when complementary strands are present in an 
5 environment below their melting temperature. The higher the 
degree of desired .homology between the probe and hybrid! sable 

i.t a that can foe 
used. As a result, it follows that higher relative 
temperatures would tend to make the reactions more stringent, 

10 while lower temperatures less so. For additional details and 
explanation of stringency of hybridisation reactions, see 
Ausubel et al. , Current Protocols in Molecular Biology I Wiley 
Inters cience Publishers, 1995. 

* Stringent conditions" or "high stringency conditions", 

15 as defined herein, may -be identified by those that (1) employ 
low ionic strength and high temperature for washing, for 
example, 15 xnM sodium chloride/1.5 mM sodium citrate/0.1% 
sodium dodecyl sulfate at 50°C; (2) employ during 
hybridization a denaturing agent, such as formamide, for 

20 example, 50% (v/v) forraamide with 0.1% bovine serum 

albumin/0.1% fieoll/0.1% polyvinylpyrrolidone/50 ixM sodium 
phosphate buffer at pH 6.5 with 750 mM sodium chloride/75 mM 
sodium citrate at 42 °C; or (3) employ 50% formamide, 5X SSC 
{750 mK sodium chloride., 75 mM sodium citrate), 50 niM sodium 

25 phosphate <pH 6.8), 0.1% sodium pyrophosphate, 5X Denhardt's 
solution, sonicated salmon sperm DMA (50 fig /ml ) , 0 .1% SDS, and 
10% dextran sulfate at 42 °C with washes at 4 2°C in 0.2X SSC 
{30 mM sodium chloride/3 mM sodium citrate) and 50% formamide 
at 55°C, followed by a high-stringency wash consisting of 0 . IX 

30 SSC containing EDTA at 55»C. 

"Moderately stringent conditions" may be identified as 
described by Sambroofc et al. [Molecular Cloning; A Laboratory 
Manual, New York: Cold Spring Harbor Press, {1989)3, and 
include the use of washing solution and hybridisation 

35 conditions (e.g., temperature, ionic strength, and %SDS) less 
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stringent than those described above. An example of 
moderately stringent conditions is overnight incubation at 
3?°C in a solution comprising: 20% formamide, 5X SSC (750 mM 
sodium chloride, 75 vtM. sodium citrate) , 50 nM sodium phosphate 
5 at pH 7,6, 5X Denhardt's solution, 10% dextran sulfate, and 20 
mg/mL denatured sheared salmon sperm DNA, followed by washing 
the filters in IX SSC at about 37~5G°C. The skilled artisan 
will recognize how to adjust the temperature, ionic strength, 
etc . f as necessary to accommodate factors such as probe length 

"10 and the like. 

"PGR" refers to the x^idely- known polymerase chain 
reaction employing a thermally- stable DNA polymerase, 

"Leader sequence" refers to a sequence of amino acids 
which can be enKymatically or chemically removed to produce 

15 the desired polypeptide of interest. 

^Secretion signal sequence" refers to a sequence of 
amino acids generally present at the K- terminal region of a 
larger polypeptide functioning to initiate association of 
that polypeptide with the cell membrane compartments like 

20 endoplasmic reticulum and secretion of that polypeptide 
through the plasma membrane. 

Construc tion of DNA encoding the heterologous fusion 
proteins of the pres en t i nvention : 

25 wild- type albumin and Immunoglobulin proteins can foe 

obtained from a variety of sources . For example, these 
proteins can be obtained from a cDNA library prepared from 
tissue or cells which express the mKKA of interest at a 
detectable level. Libraries can be screened with probes 

30 designed using the published DNA or protein sequence for the 
particular protein of interest. For example,, immunoglobulin 
light or heavy chain constant regions are described in 
Adams, et al . (1980) Biochemistry 19:2711-2719; Goughet, et 
al. (1980) Biochemistry 19:2702-2710; Dolby, et al. (1980) 

35 Proc, Natl. Acad. Sci. USA 77:6027-6031; Rice et al, (1982) 
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Proc. Natl. Acad. Sci . USA 79 : 7862-7862 ? Falkner, et al. 
(1982) .Nature 298:286-288; and Morrison, et al. (1984) Ami. 
Rev, Immunol . 2; 239-256. Some references disclosing albumin 
protein and BN& sequences include Meloun, et al. i 1975 > FEES 
5 Letters 58:136; Behrens , et al. {1975} Fed. Proc, 34:591; 
Lawn, al, (1981) Mxcieic Acids Research 9:6102-6114; and 
Minghetti, et al . (1986) J* Biol, Chem. 261:6747 

Screening a cDK'A or genomic library with the selected 
probe may be conducted using standard procedures, such as 

10 described in Sambrook et al., Molecular Cloning: A 

Laboratory Manual, Cold Spring Harbor Laboratory Press, HY 
(1989) . An alternative means to isolate a gene encoding an 
albumin or immunoglobulin protein is to use PGR methodology 
[Sambrook et al. , supra; Dieffenbach et al. , PCR Primer: A 

15 Laboratory Manual, Cold Spring Harbor Laboratory Press, NY 
{1995) 3 . PCR primers can be designed based on published 
sequences . 

Generally the full-length wild-type sequences cloned 
from a particular species can serve as a template to create 

20 analogs, fragments, and derivatives that retain the ability 
to confer a longer plasma half- life on the GLP-1 compound 
that is part of the fusion protein. It is preferred that 
the Fc and albumin portions of the heterologous fusion 
proteins of the present invention be derived from the native 

25. human sequence in order to reduce the risk of potential 
immunogenic! ty of the fusion protein in humans. 

In particular, it is preferred that the immunoglobulin 
portion of a fusion protein encompassed by the present 
invention contain only an Fc fragment of the iirsmonoglobulin . 

30 Depending on whether particular effect or functions are 
desired and the structural characteristics of the fusion 
protein, an Fc fragment may contain the hinge region along 
with the CH2 and CH3 domains or some other combination 
thereof. These Fc fragments can be generated using PCR 

35 techniques with primers designed to hybridise to sequences 
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corresponding to the desired ends of the fragment. 
Similarly, if fragments of albumin are desired, PGR primers 
can be designed which are complementary to internal albumin 
sequences. PGR primers can also "be designed to create 
5 restriction enzyme sites to facilitate cloning into 
f ass I d i vectors 

DMA encoding the GLP-1 compounds of the present 
invention can be made by a variety of different methods 
including cloning methods like those described above as well 

10 as chemically synthesized DNA. Chemical synthesis may be 
attractive given the short length of the encoded peptide. 
The amino acid sequence for GLP-1 has been published as well 
as the sequence of the preproglucagon gene. [Lopez , at al. 
{1983) Proc. Natl. Acad. Sci., USA 80:5485-5485; Bell, et 

15 al. {1983} Nature, 302:716-718; Heinrich, (3., et al. {1984} 
Endocrinol, 115:2176-2131; Ghiglione, M. , efc al. 91984) 
Diabefeologia 27:599-600] . Thus, primers can be designed to 
PGR native GLP-1 compounds and fragments thereof. 

The gene encoding a fusion protein can then be 

20 constructed by iigating DMA encoding a GLP-1 compound in- 
frame to DNA encoding an albumin or Fc protein. The gene 
encoding the GLP-1 compound and the gene encoding the 
albumin or Fc protein can also be joined in- frame via DMA 
encoding a linker peptide. 

25 , The in vivo function and stability of the heterologous 

fusion proteins of the present invention can be optimized by 
adding small peptide linkers to prevent potentially unwanted 
domain interactions. Although these linkers can potentially 
be any length and consist of any combination of amino acids,. 

30 it is preferred that the length be no longer than necessary 
to prevent ^unwanted domain interactions and/ or optimize 
biological activity and/or stability. Generally, the 
linkers should not contain amino acids with extremely bulky- 
side chains or amino acids likely to introduce significant 

35 secondary structure. it is preferred that the linker be 
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serine -glycine rich and be less than 30 amino acids in 
length. It is more preferred that the linker be no more 
than 20 amino acids in length. It is even more preferred 
that the linker be no more than 15 amino acids in length. A 
5 preferred linker contains repeats of the sequence Gly-Gly- 
Gly-Gly-Ser. It is preferred that there be between 2 and 6 
repeats of this sequence. It is even more preferred that 
there be between 3 and 4 repeats of this sequence. 

The DNA encoding wild-type GLP-1, albumin, and Fc 

10 polypeptides and fragments thereof can he mutated either 
before ligation or in the context of a cDNA encoding an 
entire fusion protein. A variety of mutagenesis techniques 
are well known in the art. For example, a mutagenic PCR. 
method utilises strand overlap extension to create specific 

15 base mutations for the purposes of changing a specific amino 
acid sequence in the corresponding protein. This PCR 
mutagenesis requires the use of four primers, two in the 
forward orientation {primers A and C) and two in the reverse 
orientation {primers B and D) . A mutated gene is amplified 

20 from the wild-type template in two different stages. The 

first reaction amplifies the gene in halves by performing an 
A to B reaction and a separate C to D reaction wherein the B 
and C primers target the area of the gene to be mutated. 
When aligning these primers with the target area, they 

25 contain mismatches for the bases that are targeted to be 

changed. Once the A to B and C to D reactions are complete, 
the reaction products are isolated and mixed for use as the 
template for the A to D reaction. This reaction then yields 
the fuil ; mutated product. 

30 Once a gene encoding an entire fusion protein is 

produced it can be cloned into an appropriate expression 
vector. Specific strategies that can be employed to make 
the GLP-1 fusion proteins of the present invention are 
described in example 1. 
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Ge neral methods to recombinantly express t he heterol ogous 
'j, ><i < r • - J Ll r AS - { - 2 - 0 U,i 

Host cells are transfected or transformed with expression 
or cloning vectors described herein for heterologous fusion 
5 protein production and cultured in conventional nutrient media 
modified as appropriate for inducing promoters, selecting 
transf ormanfcs, or- anplifying the genes encoding the desired 
sequences. The culture conditions, such as media, temperature, 
pH and the like, can be selected by the skilled artisan without 

10 undue experimentation . In general, principles, protocols, and 
practical techniques for iBaximizing the productivity of cell 
cultures can be found in Mammalian Cell Biotechnology: A 
Practical Approach, M. Butler, ed. (IRL Press, 1.99.1 J and 
Sambrook , at al., supra. Methods of trans feet ion are known to 

15 the ordinarily skilled artisan, for example, CaP0<s and 

electroporation. General aspects of mammalian cell host system 
transformations have been described in U.S. Patent Ho. 
4,399,216. Transformations into yeast are typically carried 
out according to the method of van Solingen et al., <T Bacfc. 

20 130(2) : 945-7 (1977) and Hsiao et al., Proc. Natl. Acad. Set. 
USA 7 5(8) : 3829-33 (1979), However, other methods for 
introducing DNA into ceils, such as by nuclear microinjection, 
electroporation, bacterial protoplast fusion with intact cells, 
or poiycations, e.g., polyforene or polyomi thine, may also be 

25 used. For various techniques for transforming mammalian cells, 
see Keown, et al., Methods in Ensymology 185: 527-37 (1990) and 
Kansour, ec al., Nature 336(6197) : 348-52 (1988). 

Suitable host cells for cloning or expressing the nucleic 
acid {e.g., DNA} in the vectors herein include prokaxyote, 

30 yeast, or higher eukaryote cells. Suitable prokaryotes 
include but are not limited to eubacteria, such as Gram- 
negative or Gram-positive organisms, for example, 
Hlnterobacteriacea such as E. coli. Various B. coli strains 
are publicly available, such as E. coli Ki2 strain I#I294 (ATCC 

35 3 1.448); B. coli XI 776 (ATCC 3 1.537); E. coli strain W3 110 
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{ATCC 27.32 5) and K5 772 {ATCC 53.635}. Other suitable 
proJcaryotic host cells include Enterobaetariaceae such as 
Escherichia, e.g.. E. coli, Bnterobacter , Erwinia, Klebiselia, 
Proteus, Salmonella, e.g., Salmonella typhimuriuxa, Serratia, 
5 e.g., Serratia itsarcescans , and Shigella, as well as Bacilli 
such as B. subtil is and B. lichentf ormis (e.g., B. 
lichenir ormis 4 1 ? disclosed in DD266, 7 10, published 13 
April 1989), Pseudomonas such as p. aeruginosa, and 
Streptomyces. These examples are illustrative rather than 

10 . limiting. Strain W3110 is one particularly preferred host or 
parent host because it is a coiBtaon host strain for recombinant 
DMA product fermentations. Preferably, the host cell secretes 
minimal amounts of proteolytic ensymes. For example, strain 
W3 110 may be modified to effect a genetic mutation in the 

15 genes encoding proteins endogenous to the host, with examples 
of such hosts including E. coli W3110 strain 1A2, which has 
the complete genotype ronA; E. coli W5 110 strain 9E4, which 
has the complete genotype ton4 ptr3; E . coli W3110 strain 27C7 
(ATCC 55,244}, which has the complete genotype tonA, ptr3 phoA 

20 E15 (argF-lac) 169 degP ompT /can'; E. coli W3110 strain 40B4, 
which is strain 37B6 with a non-kanamycin resistant degP 
deletion mutation; and an E. coli strain having mutant 
periplasm! c protease disclosed in U.S. Patent No. 4,946,783 
issued 7 August 1390. Alternatively, in vivo methods of 

25 cloning, e.g., PCR or other nucleic acid polymerase reactions, 
are suitable. 

In addition to prokaryotes, eukaryotie microbes such as 
filamentous fungi or yeast are suitable cloning or expression 
hosts for fusion protein vectors. Saeeharomyces cerevisiae is 

30 a commonly used lower eukaryotic host microorganism. Others 
include Sehizosaecharomyces pombe [Beach and Nurse, Nature 
290: 140-3 {1981}; EP 139,383 published 2 May 13953; 
Muyveromyces hosts [U.S. Patent Ho. 4,943, 529; Fleer, et al. , 
Bio/Technology 9 (10) : 968-75 (1991)3 such as, e.g., K lactis 

35 (MW98-SG, CBS 6 S3 , CBS4574) fde Bouveneourt et al., J. 
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Bacterial. 154(2): 737-42 {1983)3; K. fiagilis { ATCC 12,424},, 
K. bulgaricus (ATCC 16,045), K wickeramii (ATCC 24,178}, K 
waltii {ATCC 56,500) , K. drosophilarum (ATCC 36.906) [Van den 
Berg et al., Bio/Technology B{2}t 135-3 {1990)1; K. 
5 thermotoierans, and K« marxianus; yarrowia {EP 402,226); 
Pichia pas tor is {EP 183,070} [Sreekrishna et al., J. Basic 
Microbiol, 28(4): 265-73 (1988)1; Candid; Trichodarma rsesia 
{BP 244,234); Neurospora crassa [Case, et al. , Proc, Natl, 
Acad Sci. USA 76(10): 5259-63 (1979)3; Sc^armiomyces such as 
10 Schwann! cmyces occidentulis {EP 394,538 published 31 October 

1990) ; and filamentous fungi such as, e.g., Neurospora, 
Penicillium, Tolypoc ladium (WO 91/00357 published 10 January 

1991) , and Aspergillus hosts such as A. nidulans [Ballance et 
al., Biochem, Biophys . Res. Comm. 112(1): 284-9 £1983)3; 

15 Tiiburn, et al., Gene 26(2-3): 205-21 (1983) ; Ye 1 ton, et al., 
Proc. mtl. Acad . Sci. USA 81 {5}: 1470-4 (1984)3 and A. nigar 
[Kelly and Hynes, EMBO J. 4(2): 475-9 (1985)]. Kethylotropic 
yeasts are selected from the genera consisting of Hansenula, 
Candida, Kloeckera, Pichia, Sacciiarontyces , Torulopsis, and 

20 Rhodotoraia. A list of specific species that are exemplary of 
this class of yeast may be found in C. Antony, The 
Biochemistzy of Methylotrophs 269 (1982) . 

Suitable host cells for the expression of the fusion 
proteins of the present invention are derived from 

25 multicellular organisms. Examples of invertebrate cells 

include insect ceils such as Drosophila S2 and Spodoptera Sp, 
Spodoptera highS as well as plant cells. Examples of useful 
maiamalian host cell lines include Chinese hamster ovary (CHO) 
and COS cells . More specific examples include monkey kidney 

30 CVI line transformed by SV40 (COS-7, ATCC CRL 1651); human 

embryonic kidney line [293 or 293 ceils subcioned for growth 
in suspension culture, Graham, et al., J . Gen Virol., 36(1): 
59-74 (1977)3; Chinese hamster ovary eells/-DHFR [CHO, Urlaub 
and chasin, Proc Natl. Acad. Sci. USA, 77(7): 4216-20 

35 {1980}]; mouse Sertoli ceils [TM4, Mather, Biol. Reprod. 
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23 (X): 243-52 {1980)]; human lung cells {W138 . ATCC CGL 75); 
human liver cells (Hep G2, HB 8065) ? and mouse mammary tumor 
{KMT 060562, ATCC CCL51) , The selection of the appropriate 
host cell is deemed to be within the skill in the art. 
5 The fusion proteins of the present invention may be 

recombinant ly produced directly, or as a protein having a 
signal sequence or other additional sequences which create a 
specific cleavage site at the N- terminus of the mature fusion 
protein. In general, the signal sequence may be a component 

10 of the vector, or it may be a part of the fusion protein- 
encoding DMA that is inserted into the vector. The signal 
sequence may be a prokaryotic signal sequence selected, for 
example, from the group of the alkaline phosphatase, 
penicillinase, ipp, or heat-stable enterotoxin II leaders. 

15 For yeast secretion the signal sequence may be,, e.g. , the 
yeast invertase leader, alpha factor leader {including 
Sac char omyces and Kluyveromyces cc~ factor leaders, the latter 
described in U.S. Patent No. 5,010,182), or acid phosphatase 
leader, the C. albicans gluco amylase leader (BP 362,179}, or 

20 the signal described in WO 90/13646. In mammalian cell 

expression, mammalian signal sequences may be used to direct 
secretion of the protein, such as signal sequences from 
secreted polypeptides of the same or related species as well 
as viral secretory leaders . 

25 Both expression and cloning vectors contain a nucleic 

acid sequence that enables the vector to replicate in one or 
more selected host cells. Such sequences are well known for a 
variety of bacteria., yeast, and viruses. The origin of 
replication from the plasmid pBR322 is suitable for most Cram- 

30 negative .bacteria, the 2u plasmid origin is suitable for 

yeast, and various viral origins £SV4Q, polyoma, adenovirus, 
VSV or BFV) are useful for cloning vectors in mammalian cells. 

Expression and cloning vectors will typically contain a 
selection gene, also termed a selectable .marker. Typical 

35 selection genes encode proteins that (a) confer resistance to 
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antibiotics or other toxins, e.g., ampicillin, neomycin, 
methotrexate, or tetracycline, (b) complement autotrophic 
deficiencies, or (c) supply critical nutrients not available 
from complex media, e.g., the gene encoding D-alanine racemase 
5 for Bacilli . 

An example of suitable selectable markers for mammalian 
ceils are those that enable the identification of cells 
competent to take up the fusion pro he in- encoding nucleic acid, 
such as DHFR or thymidine kinase. An appropriate host cell 

10 when wild- type DHFK is employed is the CHO ceil line .deficient 
in DHFR activity, prepared and propagated as described [Url&ub 
and Chasin, Proc. Natl. Acad. Sex. USA r 77(7): 4216-20 
{1980}]. A suitable selection gene for use in yeast is the 
trpl gene present in the yeast plasmid YRp7 [Stinchcorah, et 

15 al., Nature 282 (5734) : 39-43 (1979); Kingsxnan, et al . , Gene 
7 {2); 141-52 (1979); Tschuraper, et al., Gene 10(2}; 157-66 
(1980)3. The trpl gene provides a selection marker for a 
mutant strain of yeast lacking the ability to grow in 
tryptophan, for example, ATCC No. 44076 or PBPC1 [Jones, 

20 Genetics 85 : 23-33 (1977)]. 

Expression and cloning vectors usually contain a promoter 
operably linked to the fusion protein-encoding nucleic acid 
sequence to direct irtRKJA synthesis. Promoters recognized by a 
variety of potential host cells are well known. Promoters 

25 suitable for use with prokaryotic hosts include the f$ 

lactamase and lactose promoter systems [Chang, et al., Nature 
275(5581); 617-24 (1978); Goeddel, et al., Nature 281(5732} : 
5<?4~8 (197.9)].. alkaline phosphatase, a tryptophan {up) 
promoter system [Goeddel, Nucleic Acids Res. 8(18): 4057-74 

30 (1980); EP 3 6,776 published 30 September 1981], and hybrid 
promoters such as the tat promoter [deBoer, et al., Proc. 
Natl. Acad. Sci. USA 80(1): 21-5 {1983)3. Promoters for use 
in bacterial systems also will contain a Shine-Dalgamo (S.D. ) 
sequence operably linked to the UNA encoding the fusion 

3 5 protein . 
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Examples .of suitable promoting sequences for use with 
yeast hosts include the promoters for 3 -phosphogly cerate 
kinase [Hitzeman, et al., J. Biol, Chem. 255(24} : 12073-80 
(1980)] or other glycolytic enzymes [Hess et al,, J. Adv. 
5 Enzyme Reg. 7: 149 {1968}; Holland, Biochemistry 17(23) : 4900- 
7 (1978)].. such as enolase, glyceraidehyde- 3 -phosphate 
dehydrogenase, hexokinase, pyruvate decarboxylase., 
pliosphofruccokinase, glueose~$~phosphate isoioerase, 3~ 
phosphoglycerate mutase, pyruvate kinase, trios 1 , 

10 i some-rase, phosphoglxicose isomerase, and glucokinase. 

Other yeast promoters, which are inducible promoters 
having the additional advantage of transcription controlled by 
growth conditions, are the promoter regions for alcohol 
dehydrogenase 2, i socy t ochrome C, acid phosphatase, 

15 degradative ensymes associated with nitrogen metabolism, 

metallothionein, glyceraldehyde-3-phosphate dehydrogenase, and 
enzymes responsible for maltose and galactose utilization. 
Suitable vectors and promoters for use in yeast expression are 
further described in EP 73,657. Transcription of fusion 

20 protein-encoding mRKA from vectors in mammalian host cells may 
be controlled, for example, by promoters obtained from the 
genomes of viruses such as polyoma virus, fowipox virus, 
adenovirus (such as Adenovirus 2) , bovine papilloma virus, 
avian sarcoma virus, cytomegalovirus, a retrovirus, hepatitis- 

25 B virus and Simian Virus 40 (SV40) , from heterologous 
mammalian promoters, e.g., the actio promoter or an 
irennnoolobuliri promoter, and from heat-shock promoters, 
provided such promoters are compatible with the host cell 
systems . 

30 Transcription of a polynucleotide encoding a fusion 

protein by higher eukaryotes may be increased by inserting an 
enhancer sequence into the vector. Enhancers are cis- acting 
elements of DNA, usually about from 10 to 300 bp, that act on 
a promoter to increase its transcription. Many enhancer 

35 sequences are now known from mammalian genes (globin. 
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elastase, albumin, a-ketoprotein , and insulin) . Typically, 
however, one will use an enhancer from a eukaryotic cell 
virus. Examples include the SV40 enhancer on the late side of 
the replication origin {bp 100-270} , the cytomegalovirus early 
5 promoter enhancer, the polyoma enhancer on the late side of 
the replication origin, and adenovirus enhancers . The 
enhancer may be spliced into the vector at a position 5 f or 3' 
to the fusion protein coding sequence but is preferably 
located at a site 5' from the promoter. 

10 Expression vectors used in eukaryotic host cells (yeast, 

fungi., insect, plant, animal, human, or nucleated cells from 
other multicellular organisms) will also contain sequences 
necessary for the termination of transcription and for 
stabilizing the xnJSSJA. Such sequences are commonly available 

15 from the 5' and occasionally 3' untranslated regions of 

eukaryotic or viral DNAs or eDNAs. These regions contain 
nucleotide segments transcribed as polyadenylated fragments in 
the untranslated portion of the irtRNA encoding the fusion 
protein 

20 Various forms of a fusion protein may be recovered from 

culture medium or from host cell ly sates . if membrane-bound, 
it. car; be released from the membrane using a suitable 
detergent solution (e.g., Triton-X 100) or by enzymatic 
cleavage. Cells employed in expression of a fusion protein 

25 can be disrupted by various physical or chemical means, such 
as freeze- thaw cycling, sonication, mechanical disruption, or 
cell lysing agents. 

^ - 1 1"" „<• '1 " -_<=: r -> 5 _y of tae 

3 0 present invention : 

Once the heterologous fusion proteins of the present 
invention are expressed in the appropriate host cell, the 
analogs can be isolated and purified. The following 
procedures are exemplary of suitable purification procedures: 

35 fractionation on carboxymethyi cellulose; gel filtration such 
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as Sephadex G-75? anion exchange resin such as DEAE or Mono-Q; 
cation exchange such as CM or Mono-S; protein A sepharose to 
remove contaminants such as IgG; metal chelating columns to 
bind epitope- tagged forms of the polypeptide; reversed-phase 
5 HPLC; chroma tor ocxising ; silica gel; etha.no! precipitation ; and 
ammonium sulfate precipi tat ion . 

Various methods of protein, purification may be employed 
and such methods are known in the art and described , for 
example, in Beutscher, Methods In Ezizymology 182: 83-9 (1990) 

10 and Scopes, Protein Purification: Principles and Practice, 

Spr inger-Ver lag , NY (1982). The purification sfcep(s) selected 
will depend on the nature of the production process used and 
the particular fusion protein produced. For example, fusion 
proteins comprising an Fc fragment can be effectively purified 

15 using a Protein A or Protein G affinity matix. Low or high pH 
buffers can be used to elute the fusion protein from the 
affinity matrix. Mild elution conditions will aid in 
preventing irreversible denaturation of the fusion protein. 
Imidazole- containing buffers can also be used. Example 3 

20 describes some successful purification protocols for the 
fusion proteins of the present invention. 

z: o j- 'zap V- , 1 < _ - -t ains of the 



25 Numerous methods exist to characterize the fusion 

proteins of the present invention. Some of these methods 
include : SOS-PAGE coupled with protein staining methods or 
immunoblotting using anti-IgG or anti-HSA antibodies. Other 
methods include matrix assisted laser desporption/ ionization- 

30 mass spectrometry (MALDI-MS) « liquid chromatography/mass 

spectrometry, isoelectric focusing, analytical anion exchange, 
chromato focussing , and circular dichroism to name a few. A 
representative number of heterologous fusion proteins were 
characterised using SDS PAGE coupled with immunoblotting as 
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well as mass spectrometry (See examples 4 and 5 and figures 3 
and 45 . 

For example table 3 (see example 5} illustrates the 
calculated molecular mass for a representative number of 
5 fusion proteins as well as the mass as determined by mass 
spectrometry. In addition, Figures 3 and 4 illustrate 
molecular weights of a representative number of fusion 
proteins as determined by SDS PAGE. All heterologous fusion 
proteins tested were expressed and secreted transiently. in 

10 addition., the IgK. signal sequence was cleaved to yield 
proteins with the correct N- terminus. 

Further, table 3 illustrates that in some instances the 
mass determined by mass spectrometry is greater than expected. 
This is the result of glyecsylation of the Fc portion and the 

15 C terminal extension. Enzymatic digestion of the fusion 

proteins followed by reversed-phase HPLC and mass spectrometry 
can identify peptide fractions that contain sugar moieties. 
These fractions can then foe ^-terminal amino acid sequenced to 
identify the potential glyecsylation site. For example, 

20 characterization of Exendin-4-Fc (SEQ ID NO: 29} shows that 
the serine at position 39 and threonine at position 50 are 0- 
1 inked glycosylated and the asaparagine at position 122 is M- 
1 inked glycosylated. 

A representative number of GLP-l fusion proteins were 

25 also tested for activity. Numerous methods exist to detect 

GLP-l activity in vitro and in vivo (see examples 6, 7, 8, and 
9). Table 4 (example 6) illustrates GLP-l receptor activity 
associated with several GLP-l fusions. The numbers are 
relative to the activity associated with Val 8 -GLP- 1 (7-37 }OH. 

30 All fusion proteins tested had GLP-l receptor activity. A low 
level of in vitro activity is not necessarily indicative of a 
weak effect in vivo. Because of the substantial increase in 
the half- life of these fusion proteins, weak in vitro activity 
is not generally a predictor of weak in vivo activity. Figure 

35 7 and example 7 illustrate the prolonged half- life associated 
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with the fusion proteins of the present invention. For 

ex&mole, Val " ~GLF- 1 -Fc had a half -life of approximately 45 
' 3 

hours in monkeys, Val -GLF-1-HSA had a half™ life of about 87 
hours in monkeys, Giy 8 -Glu 22 -GLP-l-CEx-Idnker-IgGl had a half™ 
5 life after IV administration of approximately 55 hours in 
dogs, and Gly 8 -Glu 22 -GLP-l-CEx~Linker-IgGl had a half -life 
after SC administration of approximately 38 hours in dogs. 

C ompos i tions of J^h^Jjn^R^^:^ 

10 Physical stability is also an essential feature for 

therapeutic protein formulations. GLP-1 compounds have been 
especially difficult to manufacture and formulate due to 
structural changes that occur during processing. For example, 
some GLP-1 compounds have a general tendency to aggregate. In 

15 addition, it has been shown that some GLP-1 compounds convert 
from a soluble and active a-helix form to an insoluble and 
potentially inactive fi- sheet form. The £ vision of GLP-1 
compounds to large proteins such as the Fc region of an IgG or 
albumin not only acts to increase the half -life of the GLP-1 

20 compound, but also contributes to the physical and 

conformational stability of the GLP-1 compound. For example, 
Val 8 -GLP-l-Linker~HSA in PBS is stable at 37°C out to about 30 
day s . 

The heterologous fusion proteins of the present invention 
25 may be formulated with one or more excipients. The active 

fusion proteins of the present invention may be combined with 
a pharmaceutical ly acceptable buffer, and the pH adjusted to 
provide acceptable stability, and a p.K acceptable for 
administration such as parenteral administration. 
30 Optionally, one or more pharmaceutically-acceptable anti- 

microbial agents may be added. Meta-cresol and phenol are 
preferred pharraaeeutically-acceptable microbial agents. One 
or more pharmaceutically-acceptable salts may be added to 
adjust the ionic strength or tonicity. One or more excipients 
35 may be added to further adjust the isotonic! ty of the 
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formulation. Glycerin is an example of an isotonicicy- 
adj usting excipient. Pharmaceutical!}' acceptable means 
suitable for administration to a human or other animal and 
thus, does not contain toxic elements or undesirable 
S contaminants and does not interfere with the activity of the 
active compounds therein. 

A pharmacauticaily-aeeeptable salt form of the 
heterologous fusion proteins of the present invention may be 
used in the present invention. Acids commonly employed to 

10 form acid addition salts are inorganic acids such as 

hydrochloric acid, hydrobromic acid, hydriodic acid, sulfuric 
acid, phosphoric acid, and the like, and organic acids such as 
p-toluenesulfonic acid, methanesulfonic acid, oxalic acid, p~ 
bromophenyl -sulfonic acid, carbonic acid, succinic acid, 

15 citric acid, benzoic acid, acetic acid, and the like. 

Preferred acid addition salts are those formed with mineral 
acids such as hydrochloric acid and hydrobromic acid. 

Base addition salts include those derived from inorganic 
bases, such as ammonium or alkali or alkaline earth metal 

20 hydroxides, carbonates, bicarbonates , and the like. Such 
bases useful in preparing the salts of this invention thus 
include sodium hydroxide, potassium hydroxide, ammonium 
hydroxide, potassium carbonate, and the like. 

25 ± sifci "to 

Administration may be via any route known to be effective 
by the physician f >r dinars fjci.; Pe pbera'i are era] is 
one such method. Parenteral administration is commonly 
understood in the medical literature as the injection of a 

30 dosage form into the body by a sterile syringe or some other 
mechanical device such as an infusion pump. Peripheral 
parenteral routes can include intravenous, intramuscular, 
subcutaneous , and intraperitoneal routes of administration . 

The heterologous fusion proteins of the present invention 

35 may also be amenable to administration by oral, rectal, nasal, 
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or lower respiratory routes, which are non-parent era 1 routes. 
Of these non-paren t eral routes, the lower respiratory route 
and the oral route are preferred. 

The fusion proteins of the present invention can be used 
5 to treat a wide variety of diseases and conditions. The 

fusion proteins of the present invention primarily exert their 
biological affects by acting at a receptor referred to as the 
VX GLP-1 receptor." Subjects with diseases and/or conditions 
that respond favorably to GLP-1 receptor stimulation or to the 

10 administration of GLP-1 compounds can therefore be treated 
with the GLP-1 fusion proteins of the present invention. 
These subjects are said to w be in need of treatment with GLP-1 
compounds" or "in need of GLP-1 receptor stimulation". 
Included are subjects with non- insulin dependent diabetes, 

15 insulin dependent diabetes, stroke {see WO 00/1679?) f 

myocardial infarction {see WO 98/08531), obesity (see WO 
98/19698), eatsbolie changes after surgery (see U.S. Patent 
2Qo. 6,006,753), functional dyspepsia and irritable bowel 
syndrome (see WO 99/64060) . Also included are subjects 

20 requiring prophylactic treatment with a GLP-1 compound, e.g., 
subjects at risk for developing non-insulin dependent diabetes 
{see WO 00/07617) . Subjects with impaired glucose tolerance 
or impaired fasting glucose, subjects whose body weight is 
about 25% above normal body weight for the subject's height 

25 and "body build, subjects with a partial pancreatectomy, 

subjects having one or more parents -with non- insulin, dependent 
diabetes, subjects who have had gestational diabetes and 
subjects who have, had acute or chronic pancreatitis are at 
risk for developing non-insulin dependent diabetes. 

30 An "effective amount" of a GLP-1 compound is the 

quantity which results in a desired therapeutic and/or 
prophylactic effect without causing unacceptable side- 
effects when administered to a subject in need of GLP-1 
receptor stimulation. A "desired therapeutic effect." 

35 includes one or more of the following: 15 an amelioration of 
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the symptom* s} associated with the disease or condition; 2} 
a delay in the onset of symptoms associated with the disease 
or condition; 3) increased longevi ty compared wifch the 
absence of the treatment; and 4} greater quality of life 
5 compared with the absence of the treatment. For example, an 
"effective amount'' of a GLP-1 compound fox: the treatment of 
diabetes is the quantity that would result in greater 
control of blood glucose concentration than in the absence 
of treatment, thereby resulting in a delay in the onset of 

10 diabetic complications such as retinopathy, neuropathy or 
kidney disease. An "effective amount" of a GLP-1 compound 
for the prevention of diabetes is the quantity that would 
delay, compared 'with the absence of treatment, the onset of 
elevated blood glucose levels that require treatment with 

15 ant i -hypogly caerai c drugs such as sul forty 1 ureas, 
thiazolidinediones, insulin and/ or bisguanidines . 

The dose of fusion protein effective to normalise a 
patient's blood glucose will depend on a number of factors, 
among which are included, without limitation, the subject's 

20 sex, weight and age, the severity of inability to regulate 
blood glucose, the route of administration and 
bioavailability, the pharmacokinetic profile of the fusion 
protein, the potency, and the formulation. 

The present invention comprises GLP-1 compounds that 

25 have improved biochemical and biophysical properties by 
virtue of being fused to an albumin protein, an albumin 
fragment, an albumin analog, a Pc protein, a pc fragment... or 
a Fc analog. These heterologous proteins can be 
successfully expressed in host cells, retain signaling 

30 activities associated with activation of the GLP-1 receptor, 
and have prolonged half -lives. 

The following examples are presented to further 
describe the present invention. The scope of the present 
invention is not to be construed as merely consisting of the 

35 following examples. Those skilled in the art will recognize 
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that the particular reagents, eqruipment, and procedures 
described are merely illustrative and are not intended to 
limit the present invention in any manner. 



5 EfciK- • ' , - Ion o[ MA 

en coding b g fc e r o 1 q g o us fusion p x o tci. ns 

= g 

Example la Construction of DMA encoding Val -GLP-I (7-37 } - 

Fc: A Fc portion of human IgGl was isolated from a 
cDNA library and contains the full hinge region and the CH2 

10 and CH3 domains. A fragment containing 696 base pairs of 
this Fc portion of human IgGl was subcloned into the Uhel 
and Bco47IlI sites of mammalian expression vector pJB02 to 
create pJB02/Fc (see Figure 5) . DNA encoding the IgK 
secretion signal sequence fused to Val -GLP-l (7 -37) was 

1 5 ger and 
complementary oligonucleotides : 



5 ' - CTAGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTGGGTT 
CCAGGTTCCACTGGTGACCAGTG - 3' [SEQ ID NO: 123 

20 

5 ' - GAGGGCACCOTCACCTCCGACGTGTCCTCCTATCTGGAGGGCCAGGCCGCCAAGGA 
GTTCATCGCCTGGCTGGTGAAGGGAAGAGGC - 3' [SBQ ID HO: 13 3 

5'- TGAAGGTGCCCTCCACGTGGTCACCAG'J?GGAACCTGC5AA.CCCAGAGCAGCAGTA 
25 CCCATAGCAGGAGTGTGTCTGTCTCCATGGTGG - 3' [SKQ ID 

NO: 14 3 



5 ' ■■■ GCCTCTTCCCTTCACCAGCCAGGCGATGAACTCCTTGGCGGCCTGGCCCTCCAGA 
TAGGAGGACACGTCGGAGG - 3' [SEQ ID NO: 153 

30 

The hybridization reaction was carried out using 
equivalent amounts of each oligonucleotide {1 pra/ul final 
concentration for each oligo) . The mixture of 
oligonucleotides was heated for 5 min at 100°C in ligation 
35 buffer (SO M Tris-HCl, pH 7.5, 10 DriM MgCl 2 , 10 mM DTT, ImM 
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ATP, 25 jig /ml bovine serum albumin) and then cooled over at 
least 2 hours to 30°C. 

The resulting hybridization product was li gated for 2 
hours at room temperature or overnight at 16°C to the 
5 pJB02/Fc vector backbone which had been digested with Nhel 
and Eco471II . The ligation products were used to transform 
competent XL-1 Blue cells (Stratagene) . Recombinant 
plasmids were screened for the presence of peptide coding 
inserts by digesting clones with Ncol {encoding the Kozak 
10 sequence and first Met of the signal peptide) and -sequenced. 
The resulting expression plasmid used for trans feet ion 
assays was denoted pJB02-V8-GLP-l-Fc (Figure 5} . 

Example lb Construction of DifA encoding Val 8 -GLP-1 .{7-37} ~ 

15 HSA: The plasmid HSA/pcDNA3 , 1GS was purchased from 

Invitrogen (Catalog $ H-M12523M-pCDNA3 . 1/GS} and used as a 
template to isolate the eDNA encoding human serum albumin 
(HSA} . The HSA cDMA was prepared using PGR wherein the DNA 
encoding the leader sequence as well as the six amino acid 

20 pro-peptide was removed from the 5' end. In addition, stop 
codons were added directly at the 3' end of the HSA coding 
sequence. Finally, restriction enzyme sites were engineered 
at the 5' and 3' end to facilitate cloning. The HSA DSSJA 
sequence present in the original vector purchased from 

25 Invitrogen contained a single base change in the 3' region 
of the gene {position 667) compared to the native human 
sequence. This change would result in a codon for Asn 
instead of Asp, Thus, using the strand overlapping PGR 
mutagenesis method discussed above, the codon was changed to 

30 code for Asp at this position. The resulting HSA encoding 
DHA was cloned into the A?heJ and Hindlll sites of pJB02 to 
create pJB02~HSA (Figure 6). 

3 

The IgK leader sequence fused to the Val -GLP-1 [7-37} 
sequence was generated as discussed in Example la. This DNA 
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was ligated into the ffliel and Fspl sites of pJB02-HSA to 
create pJB02- Val ~GLP~1-HSA. 

Example 1c Construction of DMA encoding Val 8 -GLP-1 {7-37 ) - 
5 l.inker-HSA: 

The vector -pJBQ2-HSA was prepared as discussed in 
Sxample lb. DNA encoding the linker sequence [GGGGSb was 
ligated in frame to the 5 ' end of the HSA encoding DNA to 
create pJB02 - linker-HSA (Figure 7} , DMA encoding the Igx 
10 leader sequence and fused to the Val -GLP-1 (7-37 ) sequence 
and the 5' part of the linker sequence was generated as 
discussed in Example la. This DNA was ligated into the Nhel 
and BspEI sites of pJB02 to create pJB02 - Val B -GLP -1 -linker- 
HSA. 

15 

Example Id Construction of DMA encoding Exendin-4~Fc : 

Ths plasmid pJB02/Fc was prepared as described in 
Example la. DNA encoding the IgK signal sequence fused to 
Bxending-4 was generated by in vitro hybridisation of the 
20 following overlapping and complementary oligonucleotides: 

5 ' - CTAGCCACCATGGAGACAGACACACTCCTGCTATGGGTACTGCTGCTCTG 
GGTTCCAGGTTCCACCGGTCAC - 3' [SEQ ID NO; 16] 

25 5" - CyQAGAGGGAACC'rTCACCAGCGACCTGAGCAAGCAGATGGAGGAGGAGGCCGT 
GAGACTG - 3' [SEQ ID NO: 17] 

5' - TTO^TCGAGTGGCTGAAGAACGGAGGACCAAGCAGCGGAGCCCCTCCTCCT 
AGC - 3' [SEQ ID HO: 18] 

30 

5 ' - GAACCTGGAACCCAGAGCAGCAGTACCCATAGCAGGAGTGTGTCTGTCTCCA 
TGGTGG - 3' [SEQ ID HO: 193 

5 ' - CTCCTCCTCCATCTGCTTGCTCAGGTCGCTGGTGAAGGTTCCCTCTCCGTGA 
35 CCGGTG - 3' [SEQ ID NO: 20] 
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5 ' G< TAG ;G JCTC '< C PG< l< GTC !TCC JTTC TTCAGCCACTCGAT 

OA AC AG TCI ' C A C G GC - 3' [S1Q ID NO: 21] 

The hybridization reaction was carried out as described 
5 in Example la. The hybridised product was ligated to the 
pa BO 2 vector which had been digested with Nhel arid Sco4 7XII 
as described in Example la to create pJB02 -Exendin-4 -Fc . 

le Cons tract ion of DNA encoding. Exendiix- 4-HSA : 
10 The plasmid pJB02-KSA was prepared as described in 

Example lb. DNA encoding the IgK signal sequence fused to 
Exending-4 was generated by in vitro hybridization of the 
same overlapping and complementary oligonucleotides 
described in Exaxflple Id. Hybridization reactions were also 
15 carried out as described above. DNA was cloned into unique 
Jtihel and Fspl sites in pJBD2~HSA to create pJB02~Sxendin-4- 
HSA. 

Example If Construction of DNA encoding Exendin-4-linker- 
20 HSA: 

The plasmid pJ302- linker -- HSA was constructed as 
described in Example lc DMA encoding the IgK signal 
sequence fused to Exendin-4 and the 5 ' part of the linker 

25 cloned into unique Nhel and BspEI sites in pJB02-linker-HSA 
to create pJBQ2 -Exendin-4- linker-HSA . 

Example lg Construction of DNA encoding Val 8 -GLP- 1 /C-Ex~Fc : 
The plasmid pJB02~Exendin--4-Fc was prepared as 
30 described in Example Id. The Exendin-4 encoding DBA was 
excised from the vector with Age! and Bco47XXI. The Val°- 
GLP-X/C-Ex encoding DNA was generated by in vitro 
hybridization of the following overlapping and • complementary 
oligonucleotides ; 



35 
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5' -CCGGTCACGTGGAGGGCACerrCACCTCCGACGTGTGCTCCTATCTGGA 
GGGCCAGGCCGCCA - 3' [SEQ ID MO: 22 3 

5 ' - AGGAATTCATCGCCTGGCTGGTGAAGGGCCGGGGCAGCAGCGG 
5 AGCCCCTCCTCCTAGC ~ 3' [SEQ ID NO: 23] 

5 ' - CTCGAGATAGGAGGACACGTCGGAGGTGAAGGTGCCCTCeAC 
GTGA - 3' [SEQ ID NO: 24] 

10 5' - 6CTAGGAGGAGGGGCTCCGCTGCTGCCCCGGCCCTTCACCAGCCAGGCGA 
TGAATTCCTTGGCGGCCTGGCC - 3' [SSQ ID NO: 25] 

The hybridisation, reaction was carried out as described 
in Example la. The hybridised product, was li gated in place 
15 of Exendin-4 in the pJB02--Exendin-4-Pc expression vector to 
create pJB02~Val 8 »GLP--l/C~Ex-Fc . 

Example lb Construction of DNA encoding Val 8 ~Glii 22 ~GLP-l-Fc: 

The plasmid pO'BQ2-Exendin~4-Fc was prepared as 

20 described in Example Id. The Bxendin-4 encoding DMA was 

excised from the vector with Acre! and Eco471II. The Val 8 - 
22 

Glu w "-GLP-1 encoding DNA was generated by in vitro 
hybridisation of the following overlapping and complementary 
oligonucleotides : 

25 

5' ~CCGGTCACGTGGAGGGCACCTTCACCTCCGACGTGTCGTCCTATCTCGA 
GGAGCAGGCCGCCA - 3' [SEQ ID HO: 26] 

5' - AGGAGTTCATCGCCTGGCTGGTGAAGGGCCGGGGC - 3 f [SEQ ID NO: 27] 

30 

5 ' - GCCCCGGCCCTTGACGAGCCAGGCGATGAACTCCTTGGCGGCC 
TGCTC - 3 f [SEQ ID NO: 28] 



5 ' - CTCGAGATAGGAGGACACGTCGGAGGTGAAGGTGCCCT 
35 CCACGTGA - 3' [SEQ ID NO: 29] 
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The hybridisation reaction was carried out as described 
in Example la. The hybridised product was ligafced in place 
of Exendin- 4 in the pO'E 0 2 - Exendin- 4 -Pc expression vector to 
5 create pJB02-Val°-Glu'" 2 -GL£-l-Fc. 

3 22 

Example Ix Construction of DMA encoding Val -Glu GhP~l/C- 
Ex-Pc : 

The piasmid pJB02 -Exendin- 4-Pc was prepared as 

10 described in Example Id. The Exendin- 4 encoding JMh was 

excised from the vector with &geX and BcoiJIII. The Val 8 ~ 
22 

Gln" w GLP~l/C~S.x encoding DMA was generated by in vitro 
hybridisation of the following overlapping and complementary 
oligonu cleo tides : 

15 

5 ' - CCGGTCACGTGGAGGGCACCTTCACCTCCGACGTGTCCTCC'TATCTCGA 
GGAGCAGGCCGCCA - 3' {SEQ ID NO: 3 0] 

5 ' - AGGAATTCATCGCCTGGCTGGTGAAGGGCCGGGGCAGCAGCGGA 
20 GCCCCTCCTCCTAGC ~ 3' [SEQ ID NO: 31] 

5 ' - CTCGAGATAGGAGGACACGTC'GGAGGTGAAGGTGCCC 
TCCACGTGA -3' [SEQ ID NO: 32] 

25 5' - GCTAGGAGGAGGGGCTCCGCTGCTGCCCCGGCCCTTCACCAGCCAGGCGA . 

TGAATTCCTTGGCGGCCTGCTC - 3' [SEQ ID SO: 33] 

The hybridization reaction was carried out as described 
in Example la. The hybridised product was legated in place 
3 0 of Exendin- 4 in the pJ50 2 -Exendin- 4-Fc expression vector to 
create pJB02-VaI 8 ~Glii 22 -GhP-l/C-Ex~Fc . 

Example Ij Construction of DHA encoding Gly 8 -GLP-1-Fc : 
The piasmid pJB 02 -Exendin- 4 -Fc was prepared as 
35 described in Example Id. The Exendin- 4 encoding DHA was 
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excised from the vector with Ag&l and Eco47lII. The Gly 8 - 
GLP-1 encoding DNA was generated by in vitro hybridization 
of the following overlapping and complementary 
oligonucleotides : 

5 ' - CCGGT'CACGGCGAGGGCACCTTCACTAGTGACGTGTCCTCCTATCTGQA 
GGGG C AGGCC6CCA - 3' [SEQ ID NO: 34] 

5' - AGG&GTTCATCGCCTGGCTQGTGAAGGGCCGGGGC - 3' [SBQ ID HO: 35 3 

5 ' - CTCCAGATAGGAGGACACGTCACTAGTGAAGGTGCCCTC 
GCCGTGA - 3' [SEQ ID NO:36] 

5 ' ~ GCCCCGGCCCTTCACCAGCCAGGCGATGAACTCCTTGGCGGC 
15 CTGGCC - 3' [SEQ ID NO: 37] 

The hybridization reaction was carried out as described 
in Example la. The hybridised product was ligated in place 
of Exendin-4 in the pu~B02~Exendin~4~Fc expression vector to 
20 create pOB02-Giy°~GLF~l~Fc. 

Exa mple 2 : Exp ression ->f j - :usi or> p> ri ^ ns 

Expression of the fusion proteins encoded by the DNA 
constructs of Example 1 was carried out by transiently 

25 trans fee ting HEK 293 SBHA cells {both adherent and 

suspension) . Cells were counted and seeded 24 hours prior 
to trans f ec tion . The trans feet ion cocktail was prepared by 
mixing FuGene ?H 6 transf ection reagent (Roche Molecular 
Biochemicals, catalog # 1S14443) with OptiMSM (Gibcc/BRL) 

30 and incubating at room temperature for 5 aria at which point 
BHA was added and the cocktail was incubated for an 
additional 15 rain. Immediately before transf action, fresh 
growth media was added to the plate. Tables 1 and 2 provide 
further transf ection details. 



35 
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Table 1: Reagents used in transient transfect :>ns of 2 931 
cells- 



tissue 


Number of 


DE5A 


FuGene 


OptiWEM 


Vol, of 


culture 


cells 


(m) 


{pi) 


media 


growth 


vessel 


seeded 


(ml) 


medium 
(ml) 


3 5 mm 


s x icT 5 "" 1 




9 


0.102 




100 ram 


2 X *0 " 


12 


7 3 


0.820 




700 cirs^ 


2 X 10 7 


65 


400 


4.0 


100 


(RB) 













5 Table 2: Media composition 



Growth and trans feet ion 
medium 


Harvesting medium 


MEM : I 


Kybritech base 


5 % PBS 


1 m& Ca 2 " 


20 vM HEPES 


20 irH HEPES 


2 mM L--glu ;:.a:ni ri~ 


1 yg/sO " - nsulin) 


50 fig /mi genet icin (6418 
MSG) 


1 fig /ml human transferrin 




50 ug / ml zr~i~ 



For small-scale transfeefcions (35i«rn ™ lOirou vessels), 
cells were rinsed with PBS and switched to harvesting media 

10 24 hours post- transfection and media was collected and 
replaced every 24 hours for several days. In the case of 
large-scale trans Sections {700 cm** roller bottles) , the 
roller bottles were rinsed with PBS 48 hours post- 
trans faction and changed to harvesting media. Media was 

15 collected and changed every 24 hours for at least 10 

consecutive days. Routinely , only 10 harvests were used for 
subsequent protein purification- 
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K-rrpls 3: Purification of heterologous fusion pr oteins 

5.x tiplc 3a purification of Val -GLP-l-Fc 

Approximately 4.5 liters of conditioned medium ( fusion 

protein expression level approximately 20 fig /ml) from large- 

5 scale trans feet ions was filtered using a CONO filter system 

and concentrated to 250 ml using a F ' £1 w 

S 

filtration system with a 10 K filter membrane. Val -GLP-l-Fc 
was captured with a 5 ml Hi Trap protein A column in Ix PBS, 
pH 7.4 at a flow rate of 2 ml/min and e luted with 50 mM 

10 citric acid pH 3,3. Fractions (1 ml) were collected in tubes 
containing 4 ml lx PBS and 100/il 1 M Tris pH 8 . 

F 3 2S co.t - (ii ! Tit- f pr< V mined 

by SDS-PAGE and reverse phase-HPLC on 2orbax C8, were pooled 
and applied to a Super dex 75 60/60 column in Ik PBS pH 7.4 

15 at a flow rate of 10 ml/mixi. Positive fractions (20 

mis /tube) were collected and pooled. Pooled fractions were 
then subjected to C4 Reverse Phase Chromatography in 0.1 
%TFA water at a flow rate of 3 ral/min. Val -GLP- l~B'c was 
eluted using a gradient from 5% B (0.1% TFA in acetonitrile) 

20 to 100% 3 in 70 min. Eluant fractions (3 mis/ tube} were 

collected. Acetonitrile was removed by vacuum drying and 1 
ml of HaO was added. The purified sample { approximately 32 
nils) was dislyzed twice against 4 liters of lx PBS pH7 . 4 . 

The dialyzed sample was then filtered using a MILLEX-GV 

25 0.22 urn Filter Unit and concentration was determined using 
absorption at 280 nm. 

Example 3b purification of Val --GLP-1--HSA or Val -GLP~1~ 
Linker-HSA 

30 Approximately 5.5 liters of conditioned medium (fusion 

protein expression level approximately 10,ttg/ml) was filtered 
using a CUNO filter system and concentrated to 330 mis using 
a ProFlux tangential flow filtration system with a 10 K 
filter membrane. 



WO 02/46227 



PCT/li SO J/43165 



- 74 - 

'The fusion protein was captured using a 50 mi Fast Flow 
Q column ( Pharmacia) in 20 xM Tris pH 7.4 at a flow rate of 
5ml /min. Protein was eluted using a gradient: from 0% to 50% 
20mK Tris pH 7.4, 1M NaCl in 10 CV, then to 100%B in 2 CV. 
5 Fractions containing the fusion protein were pooled and 

subjected r,o C4 Reverse Phase Chromatography in 0.1% TFh 
water at a flow rate of 5 ml/min. The fusion protein was 
eluted using a gradient from 20% 8 (0.1% TPA in 
acetonitrile) to 90% B in 120mi.n, Fractions (3.5 ml /tube) 
10 were collected. Acetonitrile was removed by vacuum drying. 

Approximately 9 mis of pooled sample was diluted with 
lx PBS pH 7.4 to 40ml and dialyzed against 4 liters of Ix 
PBS pH 7.4 overnight. The sample was filtered and 
concentration was determined by absorption at 2 8 Oram 

15 

Example 3c purification of Exendin-4-Fc: 

Approximately 4 liters of conditioned medium (fusion 
protein expression level approximately 8 /*g/ml) was filtered 
using a CIMO filter system and concentrated to 250 mis using 
20 a ProFlux tangential flow filtration system with a 3 OK 
f i 1 tsr raentbr an e . 

Fxendin-4-Fc -was captured with a 5 ml Hi Trap protein A 

elated with 50 mM citric acid pH 3.3. Fractious containing 
25 the fusion protein were pooled, filtered, and dialyzed 

against 4 liters of" 1 x PBS over night. The dialyzed sample 
was then applied to a Superdex 75 60/60 column in lx PBS 
pH7.4, 0.5M NaCl at a flow rate of 10 ml /min. Fractions (20 
ml /tube) containing the fusion protein were collected, 
30 pooled,, and concentrated to about 1 tag /ml . Concentrated 

samples were then filtered using a MILLEX-GV 0.22 urn Filter 
Unit. 
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Example 3d purification of Exendin~4-HSA and Exendin-4- 
i inker »HSA: 

Approximately 1.1 liters of conditioned medium {fusion 
5 protein expression level approximately 6j*g/ml) was filtered 
using a CUM) filter system and concentrated to 17 5 mis using 
a ProFlux tangential flow filtration system with a 3 OK 
filter membrane. 

The fusion protein was captured using a 5 ml HiTrap Q- 
10 sepharose column { Pharmacia) in 20 mM Tris pH 7.4 at a flow 
rate of 2 m 1 /rain , Protein was eluted us.i.ng a gradient from 
0% to 50% 20mM Tris pH 7.4, 1m NaCl in 12 CV and then to 
100%B in 4, CV. 

Fractions containing the fusion protein were pooled and 
15 subjected to C4 Reverse Phase Chromatography in 0.1% TFA 
water at a flow rate of 5 ml/rain. The fusion protein was 
eluted using a gradient from 10% B (0.1% TFA in 
acetonitrile) to 100% B in 70 miru Fractions {10 ml/ tube) 
containing the fusion protein were collected. Acetonitrile 
20 was removed using a vacuum dryer. 

Approximately 8 mis of pooled sample was dialyzed 
against 4 liters of Ik PBS pH 7.4 overnight. The sample was 
filtered and concentration was determined by absorption at 
280nm. The diaiysed sample was then applied to a Superdex 
25 200 2 8/60 column in Ix PBS pH 7.4, 0.5 M HaCl at a flow rate 
of 2 ral/min. Fractions (3 ml/ tube) containing the fusion 
protein were collected, pooled, concentrated, and filtered. 

Bxample 4: Char acterizatio n of fu sion protei_ns___by SDS _PRGE_^ 

30 SDS-PAGE followed by inimunoblotting was used to analyse 

both purified fusion protein as well as conditioned medium 
from ceils transfected with various fusion protein 
expression vectors. SDS-PAGE was performed on a Hovex 
Pcwerease 500 system using Kovex 16% Tris -Glycine Precast 
35 gels (EC6498) , running buffer flOx, LC2675) and sample 
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buffer {L2676} . Sajsples were reduced with 50 rnM DTT and 
heated 3-5 min at 95°C prior to loading. 

After running the SDS-PAGS gel, water and transfer 
buffer (IX Tris-Glycine Seprafouff (Owl Scientific Cat, Ho, 
5 ER26-S) with 20% methanol) were used to rinse SDS from the 
gels, A Novex transfer apparatus was used with. PVD? {BioRad, 
Cat. No. 162-0174) and nitrocellulose membranes {BioRad, 
Can. No. 17 03965 or 1703932). Transfer was carried out at 
room temperature for 9 0 rain at 3 0-3 5 V. Membranes were 

10 blocked in IX PBS with 0.1% Tween-20 {Sigma, Cat. Ho. p- 

7949) and 5% Milk (BioRad, Cat. No. 170-6404) for 1-12 hours 
at 4°C, Antibodies are diluted into IX PBS +5% Milk and the 
blots are incubated in these solutions for 1-2 h at 4°C. 
Between incubations, the blots are washed four times for 5 

15 min each with IX PBS and 0.2% Tween-20 at room temperature. 
PBS was made from either GIBCO 10X PBS {Cat No. 70011), to 
give a final composition of 1 mM monobasic potassium 
phosphate, 3 mid dibasic sodium phosphate, 153 mM sodium 
chloride, pH 7.4, or PBS pouches from Sigma (Cat. No. 1000- 

20 3), to give 120 mM NaCI, 2.7 mM KC1 and 10 mM phosphate, pH 
7.4 at 25°C. 

The primary antibodies were either a polycoional goat 
anti-IgGl or rabbit anti-HSA. The secondary antibody was 
either an anti-goat IgG HHP or an ant i -rabbit IgG HRP. The 
25 secondary antibody was diluted 1:5000. An ECI. system 

{Amersham Pharmacia Biotech, Cat. Mo. RN2108 and Cat. Ho. 
REH1S74H) was used for developing blots. 

Figure 3A compares purified Fc protein to conditioned 

8 

media from puBOO-aal --GLP-1--FC and pJB02-Kxendin-4-Fc 
30 trans fee ted cells. The decrease in mobility is consistent, 
ttfith the increased size due to the GLP-l portion of the 
fusion protein. Figure 3B similarly compares purified HSA 
with conditioned media from cells trans fee ted with pJB02~ 
Val -GLP-2-HSA, pJB02- Val 8 -GLP-l -Linker -HSA, pJB02-Exandin- 
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4-HSA, or pJ'B02-Exendin-4-Lirtker-HSA . Figure 4 identifies 
purified fusion protein preparations. 

Example 5: Characteri sa tion of fu si . on 
5 pro tein s using mass spect rometry : 

All experiments were perforated on a Micromass TofSpec- 
2H mass spectrometer equipped with Time Lag Focusing 
electronics, a Heflectron {used in analysis of the 0-8000 Da 
peptide range) , a Linear detector (used during high mass / 

10 good signal analysis) , and Post Acceleration Detector {or 
P.A.D. , used for high mass / extremely low signal analysis) 
The effective path length of the instrument in Linear mode 
is 1.2 meters, in Reflactron mode it is 2.3 meters. Two 
dual micro -channel plate detectors are fitted for linear and 

15 reflectron mode detection. The laser used is a Laser 

Science Inc. VSL~337i nitrogen laser operating at 337 nm at 
5 laser shots per second. All data were acquired using a 2 
GHz, 8 bit internal digitizer and up to 50 laser shots were 
averaged per spectrum. 

20 The instrument was operated in linear mode for the 

analysis of the GLP-1 fusion proteins in question. The 
linear detector is a device that detects ions that travel 
down the flight tube of the MAL>D I - T o F ~ MS instrument. It 
measures the ion abundance over time end sends a signal to 

25 the digitizer for conversion. The digitizer is an analog- 
to-digital converter that, allows the signal from the mass 
spectrometer to be transferred to the computer, where it is 
reconstructed into a usable m/z spectrum. 

A recrystallised saturated sinapinic acid solution 

30 (diluted in 50/50 Acn / h 2 0 and 0.1% TEA } was utilised as 

the ionization matrix. Sinapinic acid is a proper matrix' for 
proteins above 10 kDa . Mass appropriate reference proteins 
were used for internal and external calibration files in 
order to obtain accurate mass determinations for the samples 

35 analysed. Samples were all analyzed using a 1:2 sample to 
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matrix dilution. The instrument was initially set up under 
the following linear detector conditions : 

Source Voltage; 20.0 keV Pulse Voltage : 3.0 keV 

5 Extract ion Voltage: 20.0 keV Laser Coarse: 50 

Focus Voltage: 16.0 keV Laser Fine: 50 

Linear detector : 3.7 keV 
P.A..O.: (off line) 

10 These settings were modified (as needed) to give the 

best signal /noise ratio and highest resolution. Table 3 
provides a characterization of different GLP-1 fusion 
proteins. 

15 Table 3 



Fusion Protein 


Expected Mass 
{ KDa } 


Mass determined 
by Mass Spec 
{kDa} 


Val B -GLP-l-IgGl 


59.08 


Si. 94 


Val b ~Glu-^ -GLP-1 XgGl 


5.9.23 




Gly B -GLP-l~lgGl 


59.00 


62,93 


Val°-GLP-I-CSx-IgGl 


60.45 


65.1-65.6 


Vai y -Glu ii/ -GLP-l-CEX"IgGl 


60.69 


65.86 


Exendin-4-IgGl 


60.69 


65.86 


. ! i EF T - 




69.89, 70.74 


E.>;endin~4--HSA 


70.56 


70.62 


Exendin-- 4 - Linker-HSA 


71.56 


71.62 



CEx refers to a C» terminal extension and comprises the 
sequence of Ser~Ser~Gly-Ala~Pro-Pro~Pro~Ser . 
Linker is Gly-Qly-Gly-GIy-Ser-Gly-Gly-GXy-Oly-Ser-Gly- Gly- 
Gly-Gly-Ser . 



20 
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Examp le 6 : Activity of J ro teins:_ 
The ability of the fusion proteins of the present 
invention to activate the GLP-1 receptor was assessed using 
in vitro assays such as those described in EP 619,322 to 
5 Gelfand, et al . , and U.S. Patent Ho. 5,120,712, 

el . ] e acta itv of these compounds relative to 

8 

the activity of Val -GLP-1 { 7-37 ) OH as reported m Table 4, 
Figure 8 represents in vitro dose response curves for Val 8 - 
GLP-1 and Exendin-4 fusion proteins. In addition, Table 5a 
10 and 5b provide trie in vitro activity of a large group of 
GLP-1 analogs that can be f-ased to an Fc or an albumin 
protein to make biologically active fusion proteins. These 
activities are compared to GLF- 1(7-37} OK . 

15 Table 4; In vitro activity of GLP-1 fusion proteins 



Fusion Protein 


Za vifcro Activity 




of Val -GLP-1) 


Val GLE - IgGl 


1 


Exendin-4 -IgGl 


240 


Va 1 -GLP-1- Linker-KS A 


0.2 


Exendin-4-HSA 


20 


Exsndi n - 4 - Linker-HSA 


90 


Exendin-4 


500 


Va 1 b - (31 u * 1 - GL P - 1 - IgGl 


3.7 


GIv**-- GLP-1- IgGl 


3 . 3 


Val^-GLP-I-CEx-XgGl 




y 2 2 

IgGl 


29 


GLy* ■ ■ ^ i-C2»lgGi 


75 


Gly ' p-1-CEac-Linfcer-IgGl 


150 




250 




330 




4 


P 1 CEx LinJcui IqGJ 


so 
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CEx refers to a C- terminal extension and comprises the 

sequence of Ser-Ser Giy -Ala -Pro- Pro-Pro- Ser . 

Linker is Sly-Gly-Gly-Gly-Ser-Gly-Gly-Gly-Gly-Ser-Gly- Gly- 

Giy-Gly-Ser 

5 C2 is Ser-Ser-Gly-Ala-Ser-Ser-Gly-Ala , 

The amino acid sequences of the fusion proteins 

described in Tables 3 and 4 are represented in SEB ID NO: 13 

to SEQ ID HO: 31. 

Val 8 -GX.-P- 1 -Htnaan serum albumin amino acid sequence is 
10 represented toy SEQ ID NO: 13. 

I i v K ZZKF KMJ\ LIAS AO 

61 YisQQCPFSDH VKLVSJSVTBF ARTCVADESA BUCOKSLHTL FGDKIjCTvAT LRESJYGBMAD 
121 , I I I t '1 YEIARRHPYF 

181 YAPEU.FFAK RYKftAFTECC QAADXAACfcl; 9KM>ES»KDES KASSAKQRWK CASLQKFGER 

15 241 AFK&WAVARL SQRFPKABPA BVSKLVTOE^ KVHTBCCHSD I.t.ECADORAD rAKYXCENQD 
301 SISSKLKSCC III" _ r ^ ADLP3LAADF L T < 3 AEAKDVFI.GM 
361 FX.YBYARHKP OYSWI.LI.RL AK'ITS-TTLBK CCAAADPHEC YAKVFDEFKF LVEBPQMLIK 
421 QNCELFEQLS EVKFQNAU.V RYTKKVPQVS TPTLVBVSRJJ IsGKVG 8KC0K HPEAKRMFCA 
481 EDYLSWLHQ IiCVLHSKUBV SDRVTKCCrS SLVKRRPCPS ALEVTJBTYVt? KBFNASTSTF 

20 541 HADTCTLSEK ERQT.TvKQTfi.Tj VFTA/KBK.PKA DFAAFVEKCC KMDKSfCFA 

601 EEGKKLVAAS OAAK3L f SEQ ID HO: 13] 

Val 8 ~GLP~l-Linker-Hurctan serum alfoumia aroino acid secjuence is 
represented by SSQ ID NO: 14. 

25 1 HV3GTFTSDV SSYLEGQAAK SF1SSO.VKGR G6C566SGGGG SC-' > : t , - > 

61 GEEKFJKM.VL lAFAQYLQQC PFHDITl L T "J S . 'h YDFSAP.N T >LHTI rj 

121 CTVAT:1R5T- G gMADC CAKQ RRERliSCrLQ .HRDT^PifoPR LyRRBVDyyR; .JTAFHr^Bft^ 

lo, R -Wift, OA* »K AACL^jt- 

241 KQRLKCASTjQ KFGEHAFKAT 
30 301 DORAD sit :£HCDSISS1 i ::CEK.?I I '" T \EVF. NDBHPADIuPS T.AADFVSSKD 

3 61 VCKNYASAtU) VFI^MFIYSY ARRKPDYSW LLL.RLAKTYE TTIjEKC C AAA DPHECYAEVF 

421 prj n ' i o ut > i r iiatgivrytek vtqvstptev j 

- 1 ui H\f Ft^ ■ J' t v „ L x i - ^ 1 ,^ V- "I K 

3 5 601 VERCTKADDK ETC F A'SEGKK L\/AASQAAI,G 1, -SSQ ID t t 14 j 
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Gly 8 -Glu 22 -GLP-l-CEx-Linker-Human serum albumin amino acid 

sequence is represented by SEQ ID NO: 15. 

1 HGEGTFT SDV SSYLSEQAAK EFIAWLVKGR GSSGAPPPSG GGGGSGGGGS GGGGSBAHKS 
61 EVAKRFKDBG RSNFRAEVEI AFAQYLOQCP FSyi-iViSWE VTEPAKTCVA EESAEK<CDKS 
5 121 EHTEFGDKEC TVATLRETYG El.feDCCAK.QE PERBECFEQB KDDNPHI>PRI> VRPEVDVMCT 
181 AFHDN K FT FA KKY1.YJS.TARR HPYFYAFKLB FFAKRYFAAF T ECCQA ADR A ACRE PKADEL 
241 RDEGKASSAK QRLXCASLQK FGBRAFKKK& VAR1SQRFPK ASF AS VS KL V TOMTKVHTBC 
3 01 C HQDBLEC AD DRADEAKY I C ENQDSISSKL KSCCEKPLLE KSUCXAEvEK DKMPAOEPSE 
361 AA0FVJ3SKDV CKUYASAKDV FfcGMFLYEYA RRHPBYSWE LLRLAKTYE1 TIjEKCCAAAD 
10 421 PBEC YAKVF.D EFEFIATFRPQ MYlKQiSSCELR BQ'LGEYXBQE ALEVRYT&KV PQVSTPTEVE 
4.81 VSRHAGRVGS KCCKHPEAKR MRCABDYESV VX«Q&CVX,KE- KTPVSDRVTK CCTSSLVHRR. 
541 PCFSAliEVDR TYVPKF.FNAS TFTFKADICT LSEK.KEQIJCK QTALVBI.VK.H KPKATKEQLK 
601 AVMDDF&AFV EKCCKAODKE TCFAEEGKRIt VAASGAAEGE (SEQ ID NO: 15) 

15 Exendin - 4 - Human serum albumin amino acid sequence is 
represented by SEQ ID HO: 16. 

1 HGEGTFT SBIi SKQMBEEAVR BFI EVfEKNGG PSSG&PPPSD AHKSEVAHPF 
61 LVLIAFAQXX. OQCPFEDHVK LVNBVTEFAK TCVASSSAB3SF CTJKSLHTLFG DKLCTVATLR 
121 E'i'YGSMADCC AKQEPKRNBC FLQHKDDITPN LPRLVRFEVD VMCTAFHPHS ETFLKKYXiYS 
20 181 IARRHPYFYA PEEEFFAKRY KA&PTECCQA ADKAACLLPK LEEERDEGKA SSAKQRLKCA 
"-i q C>\,N,--*7 - " > - >-l> ^ 'WTAEV SF.r..VTDi,TKV HTKCCMGDEI. SCABDRADB& 
301 KYICEMQDSI SSKLKECCEK PMiEKSHCSA EVKNDEMPAD iPSLftAOFVE SKDVCKNYAE 
361 AKDVFEGMFL YBYARRKPDY SWtLjCRtAK TYSTTI.EKCC AAA0PHECYA JKVPDBPKPLV 
421 KKPQNtT : tn „ „ lALLVRY TKKVPQVSTP TLVSVSRBkG KVGSKCCKHP 

25 481 EAKRMPCAFD YASVVLiJQLC VLHEKTPVSD EVTRCC'TESE VRRRPCiASAA BVEF.TYVPKS 

x , v v t r~r - - n f < } r qikkqtatjVE um kat 31 ^ddf r? 

601 DDKETCFAEE GKKLVA&SQA ALGL [SBQ ID NO: 16] 

Exendin- 4~ Linker -Suraan. serum albumin amino acid sequence is 
30 represented by SEQ ID NO: 17. 

1 HGEGTFTSDXi SKQMBEEAVR LPlffi&RKGG PSSGAPPPSG GGGGSGGGGS GGGGSDAHKS 
SI EVAHRFKDLG EEKFKAEVLX AFAQYLQQCP PSDHVKLVNE VTEPAKTCVA DESASNCDKS 
121 LHTLFGDRLC TVATERETYG EMADCCAKQE PERNBCFLQH KDDHBBEPRE VRf'EVDVMCT 
1 F 1 F b I F i KA<. Ki AM J > i 

35 241 RDBGKASSAK QRliKCASLQK FGERAFKAVSA VARLSQRFPK AEFAEVSKLV TBKTRVHTBC 
301 CRGBLliECAB DRADI: AK.Y I C SNQDS J.SSSl, RECCFKPELE RSKCIABVEK D ERP ADT.-P S A 
3 61 DF vf. a AKDV Pi 1 1 RRKPDYSWL U AAKTYST YEEKCCAAAD 

421 PHBCYAKW3> BFKPIjVEEPQ NLIKQNCEEF EQLGEYKFOK AilA/RYTKRV PQVSTPTLVE 
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481 VS3KLGKVGS KCCKHPEAKR MPG&EDYLSV VLNQLCVLHB KTPVSDRVTK CCT3SLVNRR 
541 PCFSALSVDE TYVPKEFNAE TFTFH&DICT LSEKSROKKK QFALvELVKB KPKATKBQL-K 
601 AVMDDFAAFV SKCCKABDKS TCFAEEGKKL VAASQA&LQL [SKQ ID NO; 17] 

5 Val 8 --GLP™ 1 ••• IgGl amino acid sequences represented by SBQ ID 

HQ: 18. 

1 KVEGTFTSDV SSYLSGQA&K EFIA8LVKGS GAEFKSCDKT HTCPPCP&FE BLGGPSVFU? 
61 PPKPKDTLM1 SRTPEVTCW VDVSHEDPBV KFW5fVDOVB VHK&KTKPKB BQYHSTYRW 
121 SVLTV1.HQW LMGKEYKCK.V SNKALPAPIE K:T I S KAKGQP REPQVYTLPP SPJSENTKMQv 
10 181 SI.TCLVKGFY PSDIAVEWKS NGQ P EEJivYKT TPPVLDSDGS FFLYSKLTVi) KSi*?CQGSiVF 
241 SCSVMHBMjH HHYT'QKSLSIj SPGK {SBQ ID NO ; 18] 

Val 8 -GLP- 1 -Cax~ IgGl amino acid sequence is represented by 
SEQ ID HO; 19 . 

15 > ,> >KSC»KTHT CPl?CPAPi£i>I> 

51 GGPSYFFFPP KPKDTLMISR TPPYGYF/VY.O VSRBBPBvKF WArDGVEVH WJUCTXPKBSQ 
121 YNSTYRWSV &TVLHQEMLK GKEYKCKVSN KALPAPXEK2? ISKRKGQPRB PQVm»PPSR 
181 RBMTS^QVSL TC.LVK.GFYPS DXAV8WJ8SM3 QPESSKfYKl'TP fWDSDGSPF LYSKLTVBRS 
241 RWQQGNVFSC SV5KEALHHK Y^QKSLSLSP GK [SBQ IS NO: 1.9] 

20 

Val 8 ~GltT 2 GLP-l- IgGl amino acid sequence is represented toy 
SEQ ID HO: 20. 

1 " T T I „ix . "P/fCDKT HTCPPCP&PE 1/L-GG PSVFLF 

61 PPKPKDTLMI SRTPEVTCW VDVSHEDPEV KFNWYVEK3V2B VHKMOHCPRE EQXMSTYRW 
25 121 SVT,WX,KQtW JjNGKBYKCKV SNKALPAPIE TT KAI QP REPQ\T$ PP SH 'LIa 
181 SITCLrVKGPY FSDIAVE5?BS NGQPBMWYKT TPPVT.DSDGS FPIiYSKtiTVD KSRWQQGNVP 
241 SCSVMHEALH HTSYTQKJ5LSL SPGK f SEQ ID NO; 201 

Val a -Glu 2 **GLP- 1-CEx-IgGl amino acid sequence is represented 
30 by SEQ ID HO; 21. 

61 GGPSVFLFPP KPKDTU5ISR TPEVTCVWD 7SHB0PF.VT.F l^WYVOGVBVH I«JAKTKPREEQ 

1^1 v j\J37£ u x . vA^f ^r.i J t )1 ; 

181 EEMTKKQvSL TC1.VKGFYPS • DIAVBWESHS QP3M-3YKTTP FVItDSDGS FF LYSK1TVDKS 

3 5 241 RVIQQGOTFSC SVMKEALHNH YTQKSLSLSP GK f SEQ ID 110; 21] 
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Gly 8 ~Glu" 2 GLP~l-C2~IgGl amino acid sequence is represented 

by SEQ ID NO: 22. 

1 HGHGTFTSDV SSYLEEQAAK EFXSSSLVKGR GSSGASSGAA EPKSCDKTHT CPPCPAPELL 
61 GG'PSVPLFPP KPK.DTUlJ.SR TPBVTCVWD V8HSDPEVRF KifYVI'GVEV f iI RG\RTRPRBBQ 
5 „ _ > '\ 1 ^ ' t_ j t ^ -T - , x t OVVn.PPSR 

f SKLTVDKS 
241 l t i S f H 1 i VQ'f SRSP GR [SRQ ID t ^1 

3 22 

Gly -Glu GXjP-1-CEx-L inker- IgGl ammo acid sequence is 
10 represented by SSQ ID HO; 23. 

I iTF'J I f 1 f I/VPGR GSSGA GGGSQGGG SQSAEPJt 

61 DKTHTGPPCP APELLGGPSV PLPPPKPK.DT LMXSRTFSVT W^VSKSD PBVK^WYVD 
121 GVEVHNAKTK PREEO.YKSTY RWSVIiTVLH Q0WLKGKEYX CXVSNKSLPA P.I8KTISKAK 

i8i -rr , , j i ji, , j i rrPPVLDS 

15 i i i i ^^Ob l x<- - s ^ J j > 1 

8 22 

Gly -Glu GLP-l-CEx-Linker~igG4 amino acid sequence is 

represented by SSQ ID NO: 24. 

1 HGEGTFTSDV SSYLSSOAAK BFIA«X.VKGR GSSGAPPPSG GGGSGGGGSG GGGSAESKXG 
20 61 PPCPSCPAPE FLG0PSVFLF PPKFKDTLM1 SRTPEVTCW VDVSQSDPW QFSJKVDGVB 
j. _ _ h <• )i i _ 11 1 I t L x ^ e Jlv 

181 B.EPQVY7L PP SQEEMTKWQV SI>TC£.VKGFY PSDXAVEWSS KGQ?E>WYKT TPPTODSBGS 
241 FFLYSRLTVD KSRWQEG2WF SCSVKHEALH HHRtTQKSLSL SLGK [SSQ ID EG: 24] 

8 22 

25 Gly -Glu GLP-l-CEx~2 Linker ••• IgGa amino acid sequence is 

represented .by SEQ ID HO: 25. 

3S ) -TSr SSYLEKQAAR EFIMvLv'KGR GSSGA? PS ;G GGGSGGGGSG 

SI GGGSGGGGSA EPKSCDKTHT CPPCPAFBBL GGPSVFLFPP KPKGTGKJSR TPBVECWVD 
121 1 I S " i \"" BVH NAKTRPRESQ OiSTYRWSV LTVLBQDStfliN GKBYKCKVSN 
30 181 KAL PA P I BK'T ISKAKGQPHB PQVY r i'i,P?SR BMKKBQVSL TCLVKGFVPS DIAVSS9ESNG 
241 QPENNYKTTP SVLDSOGSFP LYSKBTVDKS RWQQGS3WSC SWOiEMAUH YTOKSIiSIiSP 
301 GK (SEQ 2D NO: 25] 

Gly 8 -Glu 22 GLP-l-2Linker-IgGl amino acid sequence is 
35 represented hy SEQ ID HO: 26. 
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I BGEGTFTSDV SSVLESQA&K 3FXAWLVXGR GGGGGSGGGG SGGGGSGGGG SGGGGSGGGG 
61 EPKSC T HTC _P X2 L JPSVFLP PPKPKDT1.MX SRTPSVPCW VDVSHEDPKV 
?l Kill,, Jl,a 4 <5 -v S i t 1 

i&i t i.;.«v< £ prrc ~"tif: ^ekf^ -h^y - 

5 241 TPPVt;DSDGS FFTA'SKDTVD KSEWQQGNVF SCSVMHEALH fj'HYTQKSLSL SPGK 
[SSQ ID NO: 26] 

Gly 8 ~G1u' m2 GLP- 1 - 2CEx- IgGl amino acid sequence is represented 

by SSQ I'D WO: 27. 
10 l ff < Y f SD" '"V VK ?F3 ? GR GSSGAPPPSS SGAPPPSAEP KSCDRTBTC? 

61 PCPAPELLQG PSVFLFPPK? F~ ~ , 1 - ' - ( , T irn--" i rr i F^nNix 
121 KTKPRSSQYN STYRVV'SVLT VLHQD^HGK EYKCKVSKKA LPAPIEKTI8 KAKGQPREPQ 
181 VY?LPPSRE£ MTKNQVSLTC LVKGFYPSDX AVEWBSHOQP ENNYKTTPFV LDSOGSFFI.Y 
241 SKLWDKSRW QQGNVPSCSV SiHS&bHHHYT QKS^SDSPGK [SEQ 2D HO: 27] 

15 

Gly 8 -Glu 22 -Val' 1 5 -1 1 e 3 3 6LP- 1 -CEx~Linker~IgGl amino acid 
sequence is represented by SEQ IB NO: 28. 

1 HGEOTFTS0V SSYl,SSQA\r< SFIAWMKGR GSSGAPPPSG GGGSGGGGSG GGGSAEPKSC 
61 DKTHTCPPCF APELLGGPSV FLFFPKFKD'r LMISRTPSVT CVWWSHKD PKVKF«W5fVD 

2 0 121 GVEVHNAKTK PREEQYNSTY RWSVLTVLH QDWl^GKSYK CKVSNK&LPA PX8KTISKAK 

181 GQPREPQVYT LP P SRE EMT R NQVSLTCLVK GFYPSDIAVE WESHGQPEMN YKTTPFVLDS 
241 DGSFFIiYSKL TVDKSRWOQG NVF3CSV33H3 AJuHSHYTQKS DSLSPGK [SEQ ID MO: 28] 

Exendxn~4-jgGl amino acid sequence is represented by SEQ ID 
25 NO: 29, 

1 HGEGTFTSDL. SKQKSBSAVR I..-F XEWLKSfGG PSSGAPPPSA SPKS CDKTHT CPPCPAPSLL 
61 GGPSVFLFPP KPKDTLMISR TPSVTCWV0 VSHEDP fc, V K F NWYVDGVEVH >!AF TRP RKEQ 
121 YNSTYRWSV LTVLMQDWLM GKEYKCJRVSM KAJLP AP I EKT ISKAKGOPEF. PQVTTLPPSR 
181 EKMTKNQVSI. TCLVKGFYPS DIAVSwESKG QPBMHYKTTP PVLDSDGSFF XjYSKLTVDKS 

3 0 2-" RWQQGNVFSt SVT EMiITKfR yTQ'i* LSLSPGK fSS ID > 
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Ln- - £y(3 1 ^fu \' -rid ^t{t ^nrf i 5 epresenfced by SEQ 
ID NO: 30. 

1 KGEGTFTSDL SKQMHEEAVR LFXEMLKNGG PSSGASSGAA EPKSCDKTHT CPPCPAPSLL 
3 61 GQPSVFI.PPP K PKDTIiMT SH TPSVTCWVD VSHEDPSVKF MRVDGVBVH NARTKPS.EEQ 
1 KCKVSN KALPAPISXT ISKAKGQPRE PQVYTLPPSR 

lei t-n^sNun t< ^ - - " < v r i - 1 r r iiyr i/^ji'/Tv 

241 RWQQGSVFSC SVHHSALHMK YTQKSi.-Sl.SP GK. j. SEQ ID NO: 30] 

10 Kxendin--4- Linker -IgGl araino acid sequence is represent by 
SEQ ID NO: 31. 

\ iGTT CbVh U i, - 1 ' IOC PSSGAPPPSG GGGSGGGCSG GGGSAEPKSC 

61 D'KTHTCPPCP APELLGGPSV FI.FPPKPKOT IMXSRTPEVT CVWOVSHED PBVKFHWTXVD 
121 GVEVHNAKTK PREEQYKSTY RWSVLWLH QDWLNGKEXK CKVSNKALPA PXEKTISK&K 

15 181 GQPRBPQWT I.PP5 >i ^ f Ov'SLTCLVK GFYPSDXAVF vKl' \U VXXTPPVItDS 

241 DGSFFLYSKL TVDKSRWQQG NVFSCSVMKE M^SJHTTQKS LSLSPGK {SEQ ID HO: 31) 
Table 5a: In vitro GLP-1 analog activity 

GLP-1 Keceptor 
GLP-1 Compound Activation 

20 

GLP-1 (7-37) OH 1.0 

Val 8 -GLP-1 {7-37} OH 0.47 

(n - 6} 

25 

Gly s -Hi s ll -GLP-X (7-37 ) OH 0.282 

Val* -Ala 11 -GLP-1 (7-37 }OH 0.G21 

30 Val 8 -LYS U ~GLP-l(7~37}OH 0.001 

Val 8 -Tyr 12 -GLP-1 {7-37 } OH 0 - 81 

Val'-Glu" •••GLP-1 {7-37 }OH 0 . Q47 
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Val 8 -Ala 16 -GLP- 1(7-37 ) OH 0.112 

Val 8 -Tyr lg -<3LP-1 (7-37)OH 1 . 175 

Val s ~Lys S0 --GLP-l{7-37)OH 0.33 

Gln 22 -GLP- 1(7-37} OH 0.42 

Val 8 -Ala a2 -GfLP- 1 (7-37 5 OH 0.56 

Val s -Ser 32 -GLP-I (7-37 JOH 0.50 

Val 8 -Asp 22 -GLP~1 (7-37) OH 0 . 40 

VaI s -Glu 22 -GLP~ 1 (7-37 } OH 1.29 

Val 8 -Lys 2a -GLP-1 (7-37) OH 0 . 53 

VaI 8 -Pro 2a -GLP-l (7-37 } OH 0 . 01 

VaI 8 -His 22 -GLP~l{7~37}OE 0.14 

Val s -Lys a2 -GLP-1 (7-36}M5 2 0 . 53 

Val s -Glu 22 -GLP-1 (7-36)NH 2 1.0 

Gly e -Glu 2a -GLP- 1 (7-37) OH 1.07 

Val 8 -Lys 23 -6LP- 1 (7-37) OH 0 . 18 

Val*-His 24 -GLP-1 (7-37} OH 0 . 007 
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Val s -Lys 24 -GLP-1 (7-37 5 OH 0 . 02 

Val 8 -His 25 -GliP-l {7-37 ) OH 1,6 

Val 8 -Glu 26 -GLP- 1(7-37} OH 1.5 

Val 8 ~Hls 27 -GLP- 1 ( 7 -37 } OH 0.37 

Va l s -Al a 27 -GLP- 1 { 7 -37 } OH 0.47 

Gly e -Glu 3C -GLP-i (7-37) OH 0.29 

Va 1 6 --Gin 30 -GLP- 1 ( 7 - 3 7 } OH 0.29 

Val 8 -Asp 30 -GLP-1 (7-37 } OH 0 . 15 

Val iJ ~Ser 30 -GLP-l ( 7-37) OH 0 . 19 

Val s ~His 3D ~-GLP -1(7-37} OH 0.19 

Val 8 -Glu 33 -GLP- 1(7-37} OH 0 . 039 

Val 8 -Ala 33 -GLP-ia-37)OH 0.1 

Val 8 -Gly 33 -GLP-1 {7-37} OH 0. 01 

Vai 8 -Glu 3 --GLF-1 (7-3730H 0.17 

^/a.I B -P.ro 35 -GLP-l{7-37}OH 0 . 094 

Val 8 -His 3S -GLP-l {7-37} OH 0.41 

Val 8 -Glu 3S -GLP- 1(7-37} OH 0.15 
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8S 



Val 8 ~Glu 3S ~GLP~l (7-37} OH 



0 



II 



Val 8 ~His 3S ~GLP~l < 7-37) OH 



0 



22 



Val 8 -His 37 -GLP~1 {7-37} OH 



0 



Val 8 -Leu i6 ~Glu 26 -GLP-l (7-37 } OH 0 .23 

Val 8 -Lys M -Qlii 30 -GLP- 1(7-37) OH 0.37 

Val 8 -Lys 22 -Glu 23 -GLP-1 ( 7 -37} OH 0.35 

Val 8 -Glu 22 -Ala 27 -GLP-1 (7-37 } OH i . 02 

Val 3 -GIu 22 -Lys 23 ~GLP-X (7-37 } OH 1 . 43 

Val 8 ~Lys 33 ~Val 34 -GLP-l (7-37)OH 0 . 08 

VaI s -Lys 33 -AsiV 4 -GLP-l {7-37 ) OH 0 . 09 

Val s -Gly 34 ~Lys 3S ~GLP-l (7-37 ) OH 0 . 34 

Val'-Gly 36 - Pro 3 7 -GLP- 1 ( 7 - 37 } HH 3 0.53 
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Table 5b. In vitro GLP-1 analog activity 



GLP-1 Compound 


(jTjP-i Receptor 
z\ a ~ ion 


GLP-1 {7-3?} OH 


1 . 0 


V'al b - GI.P -1(7-37} OH 


0 „ 47 


Gly ° - GLP- 1 { 7 - 3 7 } OH 


0.80 


Val Tyr 1 1 7)0} 


0 . SO 


Val°~ Tvr"-GL? -1(7-35; NH> 


0 . 52 


Vcti^-Trp^-GLP-l ( 7-37 ) OH 


0 . 52 


Va'.!*-Le-a :t; --GL?~l (7-37) OH 


0 - 52 


Val^-Val i0 -GLP--I {7-37 } OH 


0. 52 


Val d -Tyr ib -GLP-1 (7-37 ) OH 


X . ;i 3 


Gly '•' - Glu^-GLP ~I {7-37} OH 


1 . 03 


Val^-Leu^-GLP-l {7-37 } OH 


0.24 


Val^-Tyr^-Tyr^-GLP-i {7-37 } OH 


0.70 


Val^-Trp^-Glu^-GLP-l {7-37 ) OH 


0.80 


I ii Tv -7i -,1 -„ ( 7-37 ) OH 


1 . 27 


| y a 1 S -Tv r s ; 




1 a- "O r ~ ; ^ -i - (7 -37 ) OH 


1.69, 1.79 


I Val^-Txp 1 ' a 1 - '' 37) OH 


2 .30, 2.16 


Val 13 '~Leu v '-Gln" * -GLP-1 (7-3 7 ) OH 


2.02 


Pr/~7P~ -7TT3" gTx)- : ~~ [T-IitToh 




h< j 7 ! OH 


1 „ OS 


3 7 ! OH 


1.50, 3.10 


Val^-Ty£ xs -Glu ILL-l ?-37 OH 


2.40, 2.77 


i Val^-Phe^-Gl'u^-GLP-l (7-37) OH 


0.94 


| Val"- Ile l0 -Glu^-GLP-1 {7-3 7} OH 


1 . 38 


| Val*-Lys ii5 GUO< COO? 1(7 3 '■ 1 OH 


1.18 


t 1 : -t; -1(7-3 7) OH 


1.50 


Va V- Phe 1M - Gltf* '* -GLP - 1 { 7 - 3 7 } OH 


0.70 


Val " - Phe^'-Glu^-GLP- 1 { 7 - 3 7 ) OH 


1.27 
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LP-l(7-37)OH 


1.32 


Val*-Glu^~Ile'=~GLP-l {7-37} OH 


1.46 


Val* •••Glu" - Val" -GLP- 1 {7-37} OH 


2.21, 1,36 


Val^-Glu"' ■ - 1 le^-GLF-l (7-37 ) OH 


0.94 


T -1 Ola , a '- il -1(7-37) OK 


1.03 


Val e -Glu' i * - Ile JJ -GLP~l (7-37) OH 


2.21, 1.79, 1.60 


GLP-1 (7- 37} OH 




V« 1 -■'fN r v- 1 0 - T'r-n J "' } -0 : i it'-'~c : t,i;. w -; f 7„ 

vw^L -tjf J. ^-^ir ~cr.LU — \ / — 

37) OH 




Va I :: ' - Tro in - 1 ' u' ; '-Val" ! - 1 1 e ' J J - 
GLP-1 (7-37 ) OH 


2.35 


37 ) OH 




Val^-Glu" -Va3/ a ~ 1 le- 5 '' -GLP-1 { 7 - 
37} OH 


2.30, 2.73, 3.15 


Vfcl 8 -Trp"-Qlu"-Val'*-GLP-1 { 7- 
37} OH 


2 .07 


Val^-Cys^-Lys^-GLP-l {7-37 } OH 


1.97 


Val^Cys^-Lys^-Arg^-GLP-l (7- 
37) OH 


2.4,1.9 



Table 6: In vitro activity of GLP/Exendin analogs 



Peptide Sequence 


In Vitro 
Activity 
(% of Vai 8 -GLP- 
l(7-37)OH> 


HG EG TFT S D E S KQME S E A V R L F I £ GGEGGG G E - G H 2 


6.21 


HGEGTFTSDI SKQMEEEAVRIiF IE', LKNGGPSSG&PF 
PS-NH2 


6.75,3.25 


irEEGTFTSDLSKGFiEEEAvRGFrGWEVKGRG 


2.8 


HVEGTFTSDVSSYLEEEAVRLFiGWLVKGEG 


1 . 47 


g-'-ge ■ : " e.g. g: g ; •? • - 


0 . 11 
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WHGTFTSDVSKQMEGQAMCEFXAWLVKGRG 


0.04 


HGEGTFTSDI - EFIHC.KNGGP-NH2 


1-44 


iGEEGTFTSDLSKQM£EEAAEEFIEtEEaG2GP~NK^ 


2.80 


HGBGTFTSDVS S YLEEEAVRLF I EWI»KMOGP-NH2 


5.40 


HGEGTFT5DI,S3YLEEEAvKLP:3:EWLKNGGP-:NH2 


5.07 


HAEGTFTSDVSSYIjEGQAAKEFIAWLVKGRPSSGAPPPS 

-NH2 


3.30 


n^.^-r , "EE El I Y'JL VKGKG 


2.15 


m'EGTFTSDVSSYLEGQAAKEFlEX\!LKKGGP--HH2 


2.36 


HGEGTFTSDLSKC i : ' '.'LF I AWLVKGRG 


■■ .25 


HVBG'I EI 3 r/LEEEAAKEFIAWLVKGRG 


1.00 


HEE' TP EEC QEi 321 ZE LKHGP 




HVEGTF r 3Q ^AKEFIAWEVKGRG 


1.00 


HAEGTFTSDV 3 EE.EKEF I AWE' 


2.12 



Example 7: In viv o ph a rmacokinetics of Val -GLP-l-IgGl 

and yal 8 ■■ GLP - 1 ■■ HSA ; 

A pharmacokinetic study of V a 1 8 - GLP- 1- XgG 1 and Val**~ 

5 GLP-1-HSA was performed in cynoraolv g j cnonkeys 1 

8 

were dosed at 5.6 raaoles/kg with, either purified val -GLP-1- 
IgGL or Val -GLP- 1 -HSA . The compounds were administered as 
an intravenous bolus administration . Blood was collected 
pre-dose and at 0.083, 0.25, 0.5, 1, 4, S, 12, 24, 48, 72, 

10 96, 120, 144, 168, and 216 hours post-dose into tubes 

containing EDTA. Plasma concentrations of iromunoreac t ive. 
Val -GLP-l were determined using a radio immunoas s ay that 
utilises a polyclonal antiserum whose primary specificity is 
for the ®- terminal (7-16) region of Val 8 ~GLP»l (7-37) . 

15 Figure 9 depicts the plasma concentration of Val"-GLP-l~Fc 
and Val 8 - GLP- 1 -Linker -HSA following a single intravenous 
dose to two cynomoiogus monkeys . The Fc fusion protein had 
a half-life of approximately 45 hours and the albumin fusion 
had a half -life of approximately 87 hours. 



20 
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Two chronically cannuiated normal male beagle dogs were 
studied after an. overnight fast. Arterial and venous 
vascular access ports were accessed, and a catheter was 
5 inserted percutaneous ly into a cephalic vein and secured. 
Animals were placed in cages, and their catheters were 
attached to a swivel / fcefcher system, A solution containing 
the fusion protein Exendin-4-IgGl (11.8 \M) was injected 
intravenously {.1.0 nmol/kg; through the cephalic vein 

10 catheter. The catheter was then cleared with 10 ml of 

saline. Two hours later, a hyperglycemic (150 mg/dl) clamp 
was initiated and continued for three "hours. Arterial blood 
samples were drawn throughout this 5-hour period for 
determination of plasma concentrations of the fusion 

15 protein, glucose, and insulin. 

The results of this study were compared to those from a 
similar, previous study in which, both of the animals had 
received a bolus of saline, s.c, and three hours later were 
studied using a 3 -hour hyperglycemic {ISO mg/dl) clamp. 

20 In both sets of studies, plasma glucose concentrations 

were determined using a Beckman glucose analyzer. Plasma 
insulin concentrations were determined by employees of Lineo 
Research, Inc. using an RXA kit developed in their 
laboratories. The data is illustrated in Figures 10 and 11 . 

25 

Example 9 : In i\ 5 pin i i ie ihrrl-irt 
Gly a ~Glu 23 -GLP-l-CSx-Llnker-IgGl ; 
Two groups of three normal male beagle cogs received 
0.1 mg/kg of Giy 8 -G.lu 22 -GLP~l-CEX"Linker~IgC5i by subcutaneous 
30 {SO or intravenous (IV) administration. Plasma 

itrmunereactivity were determined by radioimmunoassay in 
samples collected from 30 minutes predose to 216 hours 
postdose for both the rv and SC groups . These 
35 concentrations were subsequently used to determine the 
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reported pharmacokinetic parameters. The mean elimination 
half-life of IV administered Sly 8 -Glu 22 -GLP-l-CSx-Linker-IgGl 
was approximately 55 hours and the total body clearance was 
1.5 mlj/h/kg. The mean elimination half -life of SC 
administered Gly s -Giu 32 -GLP-l-CEx~Linker-IgGl was 
approximately 3 8 hours. 
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WE CLAIM: 

1. A heterologous fusion protein comprising a first 
5 polypeptide with a N-- terminus and a C~ terminus fused to a 
second polypeptide with, a N- terminus and a C- terminus 
wherein the first polypeptide is a GLP-1 compound and the 
second polypeptide is selected from the group consisting of 

a} human albumin; 
10 b) human albumin analogs; and 

e) fragments of human albumin, 
and wherein the C- terminus of the first polypeptide is fused 
to the N- terminus of the second polypeptide. 

15 2. A heterologous fusion protein comprising a first 

polypeptide with a N- terminus and a C- terminus fused to a 
second polypeptide with a N- terminus arid a C- terminus 
wherein the first polypeptide is a GLP-1 compound and the 
second polypeptide is selected from the group consisting of 

20 a) human albumin; 

b} human albumin analogs; and 
c) fragments of human albumin, 
and wherein the C- terminus of the first polypeptide is fused 
to the terminus of the second polypeptide vies, a peptide 

25 linker, 

3 . The heterologous fusion protein of the Claim 2 wherein 
the peptide linker is selected from the group consisting of: 
a) a glycine rich peptide; 
30 b) a peptide having the sequence f Gly~Gly-Gly-Gly~Ser ] „ 

where n is 1, 2, 3, 4, 5 or S; and 
c) a peptide having the sequence £ Gly-Gly-Gly-Gly ~ 
Serh. 
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4 . The heterologous fusion protein of Claims 1, 2, or 3 
wherein the GLP-1 compound comprises the sequence of formula 
1 [SEQ ID HO: 2] 



5 



7 


8 


9 


10 


11 


12 


13 


14 


15 


16 


17 


His 


-Xaa 


-Xaa 


-Gly 


-Xaa 


-Phe 


-Thr 


-xaa 


-Asp 


-Xaa 


-Xaa- 


18 


19 


20 


21 


22 


23 


24 


25 


26 


27 


28 


Xaa 


-Xaa- 


-Xaa 


■Xaa 


-Xaa 


-Xaa 


-Xaa 


-Xaa 


-Xaa- 


-Xaa 


-Bhe- 


29 


30 


31 


32 


33 


34 


35 


36 


37 


38 


39 


lie 


Xaa 


Xaa 


-Xaa 


•Xaa 


-Xaa- 


-Xaa- 


-Xaa 


-Xaa 


■Xaa 


■Xaa- 


40 


41 


42 


43 


44 


45 












Xaa 


-Xaa- 


-Xaa 


-Xaa 


-Xaa 


-xaa 













Formula I (SEQ ID NO: 2} 



15 wherein: 

Xaa at position 8 is Ala, Gly, Ser, Thr, Leu, lie, Val, 

Glu, Asp, or Lys; 
Xaa at position 9 is Glu, Asp, or Lys; 
Xaa at position 11 is Thr, Ala, Gly, Ser, Leu, lie, 
20 Val, Glu, Asp, or Lys; 

Xaa at position 14 is Ser, Ala, Gly, Thr, Lea, lie, 

Val, Glu, Asp, or Lys; 
Xaa at position 16 is Val, Ala, Gly, Ser, Thr, Leu, 
lie, Tyr, Glu, Asp, Trp, or Lys; 
25 Xaa at position 17 is Ser, Ala, Gly, Thr, Leu, lie, 

Val, Glu, Asp, or Lys; 
Xaa at position 18 is Ser, Ala, Gly, Thr, Leu, lie, 

Val, Glu, Asp, Trp, Tyr, or Lys; 
Xaa at position 19 is Tyr, Phe, Trp, Glu, Asp, Gin, or 
30 Lys; 

Xaa at position 20 is Leu, Ala, Gly, Ser, Thr, lie, 

Val, Glu, Asp, Met, Trp, Tyr, or Lys; 
Xaa at position 21 is Glu, Asp, or Lys; 
Xaa at position 22 is Gly, Ala, Ser, Thr, Leu, lie, 
35 Val, Glu, Asp, or Lys; 
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Xaa at position 23 is Gin, Asn, Arg, GXu, Asp, or Lys; 
Xaa at position 24 is Ala, Gly, Ser, Thr, Leu, lie, 

Val, Arg, Giu, Asp, or Lys; 
Xaa at position 25 is Ala, Gly, Ser, Thr, Leu, lie, 

Val , Glu, Asp, or Lys; 
Xaa at position 26 is Lys, Arg, Gin, Glu, Asp, or His; 
Xaa at position 27 is Leu, Glu, Asp, or Lys; 
Xaa at position 30 is Ala, Gly, Ser, Thr, Leu, lie, 

Val, Glu, Asp, or Lys; 
Xaa at position 31 is Trp, Phe, Tyr, Glu, Asp, or Lys; 
Xaa at position 32 is Leu, Gly, Ala, Ser, Thr, lie, 

Val, Glu, Asp, or Lys; 
Xaa at position 33 is Val, Gly, Ala, Ser, Thr, Leu, 

lie, Glu, Asp, or Lys; 
Xaa at position 34 is Asn, Lys, Arg, Glu, Asp, or His; 
Xaa at position 35 is Gly, Ala, Ser, Thr, Leu, lie, 

Val, Glu, Asp, or Lys; 
Xaa at position 36 is Gly, Arg, Lys, Glu, Asp, or His; 
Xaa at position 37 is Pro, Gly, Ala, Ser, Thr, Leu, 

He, Val, Glu, Asp, or Lys, or is deleted; 
Xaa at position 38 is Ser, Arg, Lys, Glu, Asp, or His, 

or is deleted; 
Xaa. at position 39 is Ser, Arg, Lys, Glu, Asp, or His, 

or is deleted; 
Xaa at position 40 is Gly, Asp, Glu, or Lys, or is 

deleted; 

Xaa at position 41 is Ala, Phe, Trp, Tyr, Glu, Asp, or 

Lys, or is deleted; 
Xaa at position 42 is Ser, Pro, Lys, Glu, or Asp, or is 

deleted; 

Xaa at position 43 is Ser, Pro, Glu, Asp, or Lys, or is 
deleted; 

Xaa at position 44 is Gly, Pro, Glu, Asp, or Lys, or is 
deleted; and 
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Xaa at position 45 is Ala, Sar, Val, Glu, Asp, or Lys, 

or is 

deleted; 

provided that when the amino acid at position 37, 38, 39, 
5 40., 41, 42, 43, or 44 is deleted, then, each amino acid 
downstream of that amino acid is also deleted. 

5. The heterologous fusion protein of Claims 1, 2, or 3 
wherein the GLP-1 compotmd comprises the sequence of formula 
10 II {SEQ ID NO; 3) : 

7 8 9 10 11 12 13 14 15 16 17 
Xaa-Xaa-Xaa-Gly~Xaa--Xaa-Thr~Ser~Asp-Xaa-Ser- 
18 19 20 21 22 23 24 25 26 27 28 
15 Xaa~Tyr~Leu--GIu--Xaa~Xaa~Xaa-Ala-Xaa~Xaa-Phe~ 
29 30 31 32 3 3 34 35 36 37 
Ile-Xaa-Xaa--LeU"Xaa-Xaa-Xaa-Xaa-Xaa 
Formula II {SEQ ID HO: 3} 

20 wherein: 

Xaa at position 7 is: L -histidine , D-histidine, 

desamino-hi s tiding , 2-amino-histidine, {J-hydroxy- 

histidine, homohistidine, a-fiuoromethyl-histidine or 

a-raethyl -histidine ; 
25 Xaa at position 8 is: Gly, Ala, Val, Leu, He, Ser, or 

Thr; 

Xaa at position 9 is: Thr, Ser, Arg, Lys, Trp, Phe, 

Tyr, Glu, or His; 
Xaa at position 11 is: Asp, Glu, Arg, Thr, Ala, Lys, or His; 
30 Xaa at position 12 is; His, Trp, Phe, or Tyr; 

Xaa at position 16 is; Leu, Ser, Thr, Trp, His, Phe, Asp, 

Val, Tyr, Glu, or Ala; 
Xaa at position 18 is: His, Pro, Asp, Glu, Arg, Ser, Ala, or 

Lys; 

35 Xaa at position IS is- Gly, Asp, Glu, Gin, Asa, Lys, Arg, or 
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Cys; 



Xaa 


at position- 23 


is: 


His, 


Asp, 


Lys, 


Glu, 


Gin, 


or Arg; 


Xaa 


at position 24 


is : 


Glu, 


Arg, 


Ala, 


or Lys; 






Xaa 


at position 26 


is : 


Trp, 


Tyr, 


Phe, 


Asp, 


Lys, 


Glu, 


or His- 


Xaa 


at position. 27 


is: 


Ala, 


Glu, 


His, 


Phe, 


Tyr , 


Trp, 


*rg or 




Ly s ; 


















Xaa 


at position 30 


is: 


Ala, 


Glu, 


Asp, 


Ser , 


or His; 




Xaa 


at position 31 


is: 


Asp, 


Glu, 


Ser, 


Thr , 


Arg, 


Trp, 


or Lys; 




at position 33 


is: 


Asp, 


Arg , 


Val , 


Lys, 


Ala, 


Gly, 


or Glu; 


Xaa 


at position 34 


is; 


Glu, 


Lys, 


or Asp; 








Xaa 


at position 35 


i s : 


Thr, 


Ser, 


Lys, 


Arg, 


Trp, 


Tyr, 


Phe, 




Asp, Gly, Pro, 


His, or 


Glu; 












Xaa 


at position 36 


is: 


Thr, 


Ser , 


Asp, 


Trp, 


Tyr, 


Phe, 


Arg , 




Glu, or His; 


















Xaa 


at position 37 


is: 


Lys, 


Arg, 


Thr, 


Ser, 


Glu, 


Asp, 


Trp, 



■Tyr, Phe, His, Gly, Giy-Pro, or is deleted. 



6. The heterologous fusion protein of Claims 1, 2, or 3 
wherein the GLP-1 compound comprises the sequence of formula 
20 III {SSQ ID NO: 4) : 

7 8 9 10 XI 12 13 14 15 16 1? 
Xaa-Xaa-Glu-Gly-Xaa-Xaa-Thr-Ser-Asp-Xaa-Ser- 
IS 19 20 21 22 23 24 25 25 27 28 
Ser- Tyr - L eu -Glu -Xaa - Xaa -Xaa- Ai a -Xaa- Xaa- Phe - 
25 29 30 31 32 33 34 35 36 37 

Ile--Aia-Xaa-Leu~xaa~Xa&~Xaa~Xaa-Xaa 
formula III (SEQ ID NO: 4} 

wherein: 

30 Xaa at position 7 is: L-histidina, D-bistidine, desasnino- 
his tidine , 2 -axnino-histi&bae , jii-hydroxy-hi s tidine , 
homohis tidine, a-f luoromethyl-histidina or a-roethyl- 
histidine? 

Xaa at position 8 is: Gly, Ala, Val, Leu, lie, Ser, or Thr; 
35 Xaa at position 11 is: Asp, Glu, Arg, Thr, Ala, Lys, or His; 
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Xaa at position 12 is: His, Trp., Phe, or Tyr; 

Xaa at position 16 is: Leu, Ser, Thr, Trp, His, Phe, Asp, 

Val, Glu, or Ala; 
Xaa at position 22: Gly, Asp, Giu, Gin, Asa, Lys, Arg, or 
5 Cys ; 

Xaa at. position 23 is: His, Asp, Lys, Gin, or Gin; 
Xaa at position 24 is: GXu, His, Ala, or Lys; 
Xaa at position 25 is: Asp, Lys, Glu, or His; 
Xaa at position 27 is: Ala, Glu, His, Phe, Tyr, Trp, Arg, 
10 or Lys; 

xaa at position 30 is: Ala, Glu, Asp, Ser, or His; 
Xaa at position 33 is: Asp, Arg, Val, Lys, Ala, Gly, or 
Glu ; 

Xaa at position 34 is: Glu, Lys, or Asp; 
15 Xaa at position 35 is: Thr, Ser, Lys, Arg, Trp, Tyr, Phe, 

Asp, Gly, Pro, His, or Glu; 
Xaa at. position 36 is: Arg, Glu, or His; 
Xaa at position 37 is: Lys, Arg, Thr, Ser, Glu, Asp, Trp, 

Tyr, Phe, His, Gly, Gly- Pro, or is deleted, 

20 

7. The heterologous fusion protein of Claim 1, 2, or 3 
wherein the GLP-1 compound comprises the sequence of formula 
IV (SEQ ID HO: 5) : 

7 8 9 10 11 12 13 14 15 16 17 
2 5 Xaa - Xaa -Glu-Gly-Thr -Xaa -Thr - Ser - Asp -Xaa - S er -- 

13 19 20 21 22 23 24 25 26 27 28 
Ser-^r-Leu-Glu-Xaa-Xaa-Ala-Ala-Xaa-Glu-Phe- 
29 30 31 32 33 34 35 36 37 
1 1 e - Xaa-Trp-Leu-Va 1 - Lys -Xaa - Arg -Xaa 
30 formula IV I SEQ ID NO: 5} 



wherein: 

Xaa at position 7 is: L-histidine, D-histidine, desamino- 
iiis tidine , 2 -amino-histidine , 0-hydroxy-hi s t i d me , 
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homohistidine , a-fluoromethyl-histidins or a~methyl- 
histidine; 

Xaa at position 8 is: Gly, Ala, Val, Lea, He, Ser, or Thr; 
Xaa at position 12 is: His f Trp, phe, or Tyr; 
5 Xaa at position 16 is: Leu, Ser, Thr. Trp, His, phe, Asp, 

Val, Glu, or Ala; 
Xaa at position 22 is: Gly, Asp, Glu, Gin, Asn, Lys, Arg, or 

Cysf 

Xaa at position 23 is: His, Asp, Lys, Glu, or Gin; 
10 Xaa at position 26 is: Asp, Lys, Gin. or His; 

Xaa at position 30 is-. Ala, Glu, Asp, Ser, or His; 

Xaa at position 35 is: Thr, Ser, Lys, Arg, Trp, Tyr, Phe, 

Asp, Gly, Pro, His, or Glu; 
Xaa at position 37 is: Lys, Arg, Thr, Ser, Glu, Asp, Trp, 
15 Tyr, Phe, His, -NH 2 , Gly, Gly-Pro, or Gly-Pro-JJH 2 , or is 

deleted. 

8. The heterologous fusion protein of Claims 1, 2, or 3 
wherein the GLP-1 compound comprises the sequence of formula 
20 V {SEQ ID NO: 6) 

7 8 9 10 11 12 13 14 15 16 17 
Xaa-Xaa-GI .u-Gly-Thr - Phe-Thr- Ser ~Asp~Val -Ser- 
18 19 20 21 22 23 24 25 26 27 28 
25 Ser-Tyr-Leu-Glu-Xaa-Xaa-Ala-Aia-Lys-Xaa-Phe™ 
29 30 31 32 33 34 35 36 37 
I le~Xaa~Trp~Leu~Val -Lys -Gly- Arg- Xaa 
formula V (SEQ ID HO: 6} 

3 0 wherein : 

Xaa at position 7 is: L-histidine, D-histidine, desaiaino- 
histidine , 2-amino-histidine, ^-hydroxy -hi s tidine , 
homoMstidine, a-f Itioromethyl-histidine or a-methyl- 
histidine; 

35 Xaa at position 8 is: Gly, Ala, Val, Leu, He, Ser, or Thr; 
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Xaa at position. 22 is: Gly, Asp, Glu, Gin, Asn, Lys, Arg, or 
Cys ; 

Xaa at position 23 is:. His, Asp, Lys, Gin, or Gin; 
Xaa at position 2 4 is: Ala, Gin, His, Phe, Tyr, Trp, Arg, 
5 or Lys 

Xaa at position 30 is: Ala, Glu, Asp, Ser, or His; 

Xaa at position 37 is: Lys, Arg, Thr, Ser, Glu, Asp, Trp, 

Tyr, Phe, His, Gly, Gly- Pro, or is deleted. 

10 9. The heterologous fusion protein of Claims 1, 2, or 3 

wherein the GLP-1 compound comprises the sequence of formula 
VIII {SEQ ID NO: 11} . 

7 8 9 10 .11 12 13 14 15 16 17 
15 Xaa-xaa-Glu-Gly-Thr-Phe-Thr-Ser-Asp-Xaa-Ser- 
13 19 20 21 22 23 24 25 26 27 28 
Xaa-Xaa-Xaa~Glu-Xaa-xaa~Ala-xaa~xaa-xaa-Phe- 
29 30 31 32 33 34 35 36 37 
Ile-Xaa-Trp-Leu~Xaa~Xaa-Giy-Xaa-Xaa 
20 formula VIII (SEQ id NO: 11) 

wherein : 

xaa at position 7 is: L-histidine, D-histidin®, desamino- 
histidine, 2-amino-histidine, 0~hydroxy~Ms tidine , 
25 homohisticUne, a--f luoromethyl-histidine or a-methyl- 

histidine; 

Xaa at position 8 is: Gly, Ala, or Val; 

Xaa at position 16 is: Leu or Val; 

Xaa at position 18 is Lys or Ser; 
30 Xaa at position 19 is: Gin or Tyr; 

Xaa at position 20 is: Met or Leu; 

Xaa at position 22 is: Glu or Gin; 

Xaa at position 23 is: Glu, or Gin; 

Xaa at position 25 is: Val or Ala; 
35 Xaa at position 26 is: Arg or I.ys; 
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Xaa at position 27 is Leu or Gin; 
Xaa at position 30 is: Glu or Ala; 
Xaa at position 33 is- Val or Lys; 
Xaa at position 34 is: Asn or Lys; 
5 Xaa at position 36 is: Gly or Arg; and 

Xaa at position 37 is; Gly, Pro, Pro-Ser-Ser-Gly-Ala-Pro- 
Pro-Pro-Ser, or is absent. 

10 . The heterologous fusion protein of claims 1 through 9 
10 wherein the GLP-l compound has no more than 6 amino acids 

that are different from the corresponding amino acid in GLP- 
1(7-37>0H, GLP-l (7-36) OH, or Exendin-4 . 

11. The heterologous fusion protein of Claim 10 wherein the 
15 GLP-l compound has no more than 5 amino acids that differ 

from the corresponding amino acid in GLP-l (7-37) OH, GLP-l (7- 
36} OH, or Exendin-4 . 

12 . The heterologous fusion protein of Claim 11 wherein the 

2 0 GLP-l compound has no more than 4 amino acids that differ 

from the corresponding amino acid in GLP-l (7-37) OH, GLP-l {7- 
36 } OH, or Exendin-4. 

13 . The heterologous fusion protein of Claim 12 wherein the 
25 GLP-l compound has no more than 3 amino acids that differ 

from the corresponding amino acid in GLP-l {7-37 ) OH, GLP-l (7- 
36} OH, or Exendin-4. 

14. The heterologous fusion protein of Claim 13 wherein the 

3 0 GLP-l compound has no more than 2 amino acids that differ 

from the corresponding amino acid in GLP-l (7 -37} OH, GLP-l (7- 
36) OH, or Exendin-4. 

15. The heterologous fusion protein of any one of Claims 1 
3 5 through 14 wherein Xaa at position 8 is glycine or valine. 
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16. The heterologous fusion protein of Claim 8 wherein the 
GLP-1 compound has no more than 2 amino acids that differ 
from the corresponding amino acid in GLP-1 (7-37) OH or GLP-- 

5 1 {7-36} OH and Xaa at position 8 is glycine or valine and Xaa 
at position 30 is alanine, glutamic acid, aspartic acid, 
serine , or histidine. 

17 . The heterologous fusion protein of Claim 16 wherein Xaa 
10 at position 30 is Glutamic acid. 

IS. The heterologous fusion protein of Claim 8 wherein the 
GLP-1 compound has no more than 2 amino acids that differ 
from the corresponding amino acid in GLP-1 {7-37 ) OH or GLP- 
15 1(7-36} OH and Xaa at position 8 is glycine or valine and Xaa 
at position 37 is histidine, phenylalanine, tyrosine, 
tryptophan, aspartic acid, glutamic acid, serine, threonine, 
arginine, or lysine. 

20 19. The heterologous fusion protein of Claim 18 wherein Xaa 
at position 37 is histidine. 

20. The heterologous fusion protein of Claim 8 -wherein the 
GLP-1 compound has no more than 2 amino acids that differ 

25 from the corresponding amino acid in GLP - 1 (7 -37 ) OH or GLP- 

1(7-36) OH and Xaa at position 8 is glycine, valine, leucine, 
isoieucine, serine, threonine, or methionine and Xaa at 
position 22 is aspartic acid, glutamic acid, lysine, 
arginine , asparagixie, glutamics or histidine. 

30 

21. The heterologous fusion protein of Claim 20 wherein Xaa 
at position 22 is lysine or glutamic acid. 

22 . The heterologous fusion protein of Claim 8 wherein the 
35 GLP-1 compound is Vai 8 -GLP-1 {7-37) . 
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23 . The heterologous fusion protein, of Claim 8 wherein the 
GLP-1 compound is Gly s »GLF~i (7-37) . 

5 24. The heterologous fusion protein of any one of Claims 1 
through 23 wherein the second polypeptide is human albumin. 

25. The heterologous fusion protein of claim 24 wherein the 
second polypeptide has the sequence of SSQ ID NO: 34. 

10 

26. The heterologous fusion protein of any one of Claims 1 
through 23 wherein the second polypeptide is an H~ terminal 
fragment of albumin. 

15 27. A heterologous fusion protein comprising a first 

polypeptide with a N- terminus and a C- terminus fused to a 
second polypeptide with a N- terminus and a C- terminus 
wherein the first polypeptide is a GLP-1 compound and the 
second polypeptide is selected from the group consisting of 

20 a} the Fc portion of an immunoglobulin; 

b) an analog of the Fc portion of an immunoglobulin; 
and 

c) fragments of the Fc portion of an immunoglobulin, 
and wherein the C- terminus of the first polypeptide is fused 

25 to the N~ terminus of the second polypeptide. 

28. h heterologous fusion protein comprising a first 

polypeptide with a N- terminus and a C- terminus fused to 
a second polypeptide with a n- terminus and a C- terminus 
30 wherein the first polypeptide is a GLP-1 compound and the 

second polypeptide is selected from the group consisting of 
a} the Fc portion of an immunoglobulin; 
b) an analog of the Fc portion of an immunoglobulin? 
and 

35 
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c) fragments of the Fc portion of an immunoglobulin, 
and wherein the C- terminus of the first polypeptide is 
fused to the N- terminus of the second polypeptide via a 
5 peptide linker. 

29, The heterologous fusion protein of the Claim 28 wherein 
the peptide linker is selected iron the group consisting of: 

a) a glycine rich peptide; 
10 b) a peptide having the sequence [Gly~Gly~Gly ■Gly-Ser]- i 

where n is 1, 2, 3, 4, 5, or 6; and 
c) a peptide having the sequence [Gly-Gly-Gly-Gly-Serj 3 . 

30. The heterologous fusion protein of any one of Claims 
15 27, 28, or 29 wherein the GLP-1 compound comprises the 

sequence of formula 1 [SEQ ID MO: 2]. 

7 8 9 10 II 12 13 14 15 16 17 
His - Xaa-Xaa-Gly -Xaa~Fhe-Thr-Xaa - Asp-Xaa~ Xaa- 

20 18 13 20 21 22 23 24 25 26 27 28 

x aa - Xaa - Xa a - Xaa - Xaa-Xaa~Xaa-Xaa- Xaa~Xaa~ Phe- 
29 30 31 32 33 34 35 36 37 33 39 
Xle-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa- 
40 41 42 43 44 45 

2 5 Xaa~Xaa-Xaa~Xaa-Xaa~Xaa 

Formula I (SEQ ID HO : 2} 

wherein: 

Xaa at position 8 is Ala, Gly, Ser, Thr, Leu, lie, Val, 
30 61 u, Asp, or Lys; 

Xaa at position 9 is Glu, Asp, or Lys; 

Xaa at position 11 is Thr, Ala, Gly, Ser, Leu, He, Val, 

Glu, Asp, or Lys; 
Xaa at position 14 is Ser, Ala, Gly, Thr, Leu, lie, Val, 
35 Glu, Asp, or Lys; 
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Xaa at position 16 is Val., Ma, Gly, Ser, Thr, Leu, He, 

Tyr, Glu, Asp, Trp, or Lys; 
Xaa at position 17 is Ser, Ala, Gly, Thr, Leu, lie, Val, 

Glu, Asp, or Lys; 
Xaa at position 18 is Ser, Ala, Gly, Thr, Leu, He., Val, 

Glu, Asp, Trp, Tyr, or Lys; 
Xaa at position 1.9 is 'pyx, Phe, Trp, Glu, Asp, Gin, or Lys? 
Xaa at position 20 is Leu, Ala, Gly, Ser, Thr, He, Val, 

Glu, Asp, Met, Trp, Tyr, or Lys; 
Xaa at position 21 is Glu, Asp, or Lys; 

Xaa at position 22 is Gly, Ala, Ser, Thr, Leu, He, Val, 

Glu , Asp , or Lys ; 
Xaa at position 23 is Gin, Asn, Arg, Glu, Asp, or Lys; 
Xaa at position 24 is Ala, Gly, Ser, Thr, Leu, lie, Val, 

Arg, Glu, Asp, or Lys; 
Xaa at position 25 is Ala, Gly, Ser, Thr, Leu, lie, Val, 

Glu, Asp, or Lys; 
Xaa at position 26 is Lys, Arg, Gin, Glu, Asp, or His; 
Xaa at position 27 is Leu, Glu, Asp, or Lys; 
Xaa at position 30 is Ala, Gly, Ser, Thr, Leu, lie, Val, 

Glu, Asp, or Lys; 
Xaa at position 31 is Trp, Phe, Tyr, Glu, Asp, or Lys; 
Xaa at position 32 is Leu, Gly, Ala, Ser, Thr, He, val, 

Glu, Asp, or Lys; 
Xaa at position 33 is Val, Gly, Ala, Ser, Thr, Leu, lie, 

Glu, Asp, or Lys; 
Xaa at position 34 is Asn, Lys, Arg, Glu, Asp, or His; 
Xaa at position 35 is Gly, Ala, Ser,- Thr, Leu, lie, Val, 

Glu, Asp, or Lys; 
xaa at position 36 is Gly, Arg, Lys, Glu, Asp, or His; 
Xaa at position 3^ is Pro, Gly, Ala, Ser, Thr, Leu, He, 

Val, Glu, Asp, or Lys, or is deleted; 
Xaa at position 38 is Ser, Arg, Lys, Glu, Asp, or His, or 

is deleted ; 

Xaa at. position 35 is Ser, Arg, Lys, Glu, Asp, or His, or 
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is deleted; 

Xaa at position 40 is Gly, Asp, Glu, or Lys, or is deleted; 
Xaa at position 41 is Ala, Phe, Trp, Tyr, Glu, Asp, or Lys, 
or is deleted; 

5 Xaa at position 42 is Ser, Pro, Lys, Glu, or Asp, or is 
deleted; 

Xaa at position 43 is Ser, Pro, Glu, Asp, or Lys, or is 
deleted; 

Xaa at position 44 is Gly, Pro, Glu, Asp, or Lys, or is 
10 deleted ; 

and 

Xaa at position 45 is Ala, Ser, Val, Glu, Asp, or Lys, or is 
deleted; 

provided that when the amino acid at position 37, 33, 39, 
15 40, 41, 42, 43, or 44 is deleted, then each amino acid 
downstream of that amino acid is also deleted. 



31. The heterologous fusion protein of Claims 27, 28, or 29 
wherein the GLP-l compound comprises the sequence of formula 
20 II ( SEQ ID NO: 3) : 



7 8 .9 10 11 12 13 14 15 IS 17 
Xaa~Xaa-Xaa-Gly~Xaa~Xaa-Thr-Ser-Asp-Xaa-Ser- 
18 19 20 21 22 23 24 25 26 27 28 
2 5 Xaa - Tyr - Leu- G lu-xaa-Xaa- Xaa -Ala - Xaa- Xaa- Phe - 

29 30 31 32 33 34 35 36 37 
Ile-Xaa-Xaa-Leu-Xaa-Xaa-Xaa-Xaa-Xaa 
Formula II {SEQ ID HO; 3) 

30 wherein: 

Xaa at position 7 is: L-histidine, P-histidine, desamino- 
his tidine, 2~amino-his tidine , f>-h,yd:roxy~hist idine , 
homohls tidine, a~f iuoromethyl -his tidine or ce-methyl- 
histidine; 

35 Xaa at position 8 is: Gly, Ala, Val, Leu, lie, Ser, or Thr; 
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xaa 


at position 9 
Glu, or His; 


is: 


Thr, 


Ser, 


Arg, 


Lys , 


Trp , 


Phe, 


Tyr , 




Xaa 


a t. position 11 


is- 


Asp 


Glu, 


Arg 


Tbr, 


Ala, 


Lys; , 


or His; 




Xaa 


at position 12 


is- 


His, 


Trp , 


Phe, 


or Tyr; 






5 


Xaa 


at position 16 
Val, Tyr , Glu 


is- 


Leu, 
Ala; 


Ser, 


Thr, 


Trp, 


His, 


Phe, 


Asp, 




Xaa 


at position 18 
Lys; 


' ±T> 


His,. 




Asp, 


Gin, 


Arg, 


Ser, 


Ala , or 




Xaa 


at posit i on 19 


is- 


Gly, 


Asp, 


Glu, 


Gin, 


As n, 


Lys, 


Arg , or 


10 




Cys ; 




















Xaa 


at position 23 


is- 


His, 


Asp, 


L s 


Glu, 


Gin, 


or Arg; 




Xaa 


at position 24 


is* 


Glu, 




Ala, 


or L 


ys; 








Xaa 


at position 26 


is : 


Trp, 


Tyr, 


Phe, 


Asp, 


Lys, 


Glu, 


or His; 




Xaa 


at position 27 


is : 


Ala, 


Glu, 


His, 


Phe, 


Tyr, 


Trp, 


Arg , or 


15 




Lys ; 




















Xaa 


at position 30 


is: 


Ala, 


Glu, 


Asp, 


Ser, 


or His; 






Xaa 


at position 31 


is; 




Glu, 


Ser, 


Thr, 


Arg, 


Trp, 


or Lys; 




Xaa 


at position 33 




Asp' 




Val, 


Lys, 


Ala, 


Gly, 


or Glu; 




Xaa 


at position 34 


is: 


Glu, 


Lys , 


or Asp,* 








20 


Xaa 


at position 35 
Asp, Gly, Pro 


is: 
, Hi 


Thr , 


Ser, 
Glu; 


Lys, 


Arg, 


Trp, 


Tyr, 


Phe , 




Xaa 


at position 36 
Glu, or His; 


is: 


Thr, 


Ser, 


Asp, 


Trp , 


Tyr 


Phe, 


Arg, 




Xaa 


at position 37 


is: 


Lys, 


Arg, 


Thr, 


Ser , 


Glu, 


Asp, 


Trp, 



25 Tyr, Phe, His, Gly, Gly-Pro, or is deleted. 



32, The heterologous fusion protein of Claims 27, 28, or 29 
wherein the GLP-1 compound comprises the sequence of formula 
III (SEQ ID NO; 4} : 
30 7 8 9 10 II 12 13 14 15 16 17 

Xaa-Xaa-Glu-Gly-Xaa-Xaa-Thr-Ser-Asp-Xaa-Ser" 
18 19 20 21 22 23 24 25 26 27 28 
Ser -Tyr-Leu -Gl u -Xaa -Xaa -Xaa - Ala-Xaa-Xaa -Phe- 
29 3 0 31 32 33 34 35 3 6 37 
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formula III (SEQ ID NO: 4} 

wherein : 

Xaa at position 7 is: L-histidine, D-histidine, desamino- 
5 histidine, 2 - -amino -histidine, ^-hydroxy-hi s t idine , 

horaohistidine, ct-f iuoromethyi -histidine or ot-methyl- 
histidine; 

Xaa at position 8 is: Gly, Ala, Val, Lev., lie, Ser, or Thr; 
Xaa at position 11 is: Asp, Glu, Arg, Thr, Ala, Lys, or His; 
10 Xaa at position 12 is: His, Trp, Phe, or Tyr; 

Xaa at position 16 is: Leu, Ser, Thr, Trp, His, Phe, Asp, 

Val, Glu, or Ala; 
Xaa at position 22: Gly, Asp, Glu, Gin, Asrt, Lys, Arg, or 

Cys; 

15 Xaa at position 23 is: His, Asp, Lys, Glu, or Gin; 
Xaa at position 24 is: Glu, His, Ala, or Lys; 
Xaa at position 25 is: Asp, Lys, Glu, or His; 
Xaa at position 27 is: Ala, Glu, His, Phe, Tyr, Trp, Arg, 
or Lys; 

20 Xaa at position 30 is: Ala, Glu, Asp, Ser, or His; 

Xaa at position 33 is: Asp, Arg, Val, Lys, Ala, Gly, or 
Glu ; 

Xaa at position 3 4 is: Glu, Lys, or Asp; 

Xaa at position 35 is: Thr, Ser, Lys, Arg, Trp, Tyr, Phe, 
55 Asp, Gly, Pro, His, or Glu; 

Xaa at position 3 5 is: Arg, Glu, or His; 

Xaa at position 37 is: Lys, Arg, Thr, Ser, Glu, Asp, Trp, 
Tyr, Phe, His, Gly, Gly-Pro, or is deleted. 

30 33. The heterologous fusion protein of Claim 27, 28, or 29 
wherein the GLP-1 coinpound comprises the sequence of formula 
IV { S'EQ ID NO: 5}: 

1 S 9 10 11 12 13 14 15 16 17 
Xaa-Xaa-Glu-Gly-Thr-Xaa -Thr- Ser- Asp- Xaa-Ser- 
35 18 19 20 21 22 23 24 25 2b 27 28 
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Ser-Tyr-Leu-GIu-Xaa-Xaa-Ma-Aia-Xaa-Glu-Phe- 
29 30 31 32 33 34 35 36 37 
Ile-Xaa~Trp-Leu-Val~Lys-Xaa-Arg~Xaa 
formula IV {SEQ ID NO: 5) 

5 

wherein : 

Xaa at position 7 is: L-histidine, D-hist.id.ine, desamino- 
histidine, 2~amino-his tidine, ^-hydroxy -hi stidine, 
homohi stidine, a- f luoromethyi -his tidine or a-raethyl~ 
10 hi stidine; 

Xaa at position 8 is: Gly, Ala, Val, Leu, lie, Ser, or Thr; 

Xaa at position 12 is: His, Trp, Phe, or Tyr; 

Xaa at position 16 is; Leu, Ser, Thr, Trp, His, Phe, Asp, 
Val, Glu, or Ala; 
15 Xaa at position 22 is: Gly, Asp, Glu, Gin, Asn, Lys, Arg, or 
Cys; 

Xaa at position 23 is: Eis, Asp, Lys, Glu, or Gin,* 
Xaa at position 26 is: Asp, Lys, Glu. or His; 
Xaa at position 30 is: Ala, Glu, Asp, Ser, or His; 
20 Xaa at position 35 is: Thr, Ser, Lys, Arg, Trp, Tyr, Phe, 
Asp, Gly, Pro, His, or Glu; 
Xaa at position 37 is; Lys, Arg, Thr, Ser, Glu, Asp, Trp, 
Tyr, Phe, His, Gly, Gly- Pro, or is deleted. 

25 34. The heterologous fusion protein of Claims 27, 28, or 2.9 
wherein the GLP-1 corcpound comprises the sequence of formula 
V (3SQ ID NO: 6) 

7 8 9 10 11 12 13 14 15 16 17 
3 0 Xaa-Xaa-Glu-Gly-Thr~Phe~Thr-Ser-Asp-Val~Ser- 
IS 19 20 21 22' 23 24 25 26 27 28 
S er - Tyr-Leu-Glu-Xaa -Xaa -Ala -Ala- Lys -Xaa - Phe - 
29 30 ' 31 32 33 34 35 36 37 
Ile-Xaa-Trp-Leu-Val-Lys-Gly-Arg-Xaa 
35 formula V (SEQ ID NO: 6) 
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wherein: 

Xaa at position. 7 is: L-his tidine, P-histidine, desamino- 
histidine., 2 -ainino-his tidine , P -hydroxy- his tidine, 
5 homohis tidine, rx-f iuoromethyl -histidine or a-methyl- 

his tidine; 

Xaa at position 8 is: Gly, Ala, Val, Leu, He, Ser, or Thr; 
Xaa at position 22 is: Gly, Asp, Glu, Gin, Asn, Lys, Arg, or 
Cys ; 

10 Xaa at position 23 is: His, Asp, Lys, Glu, or Gin; 

Xaa at position 24 is: Ala, Glu, His, Pile, Tyr, Trp, Arg, 
or Lys 

Xaa at position 30 is: Ala, Glu, Asp, Ser, or His; 
Xaa at position 37 is; Lys, Arg, Thr, Ser, Glu, Asp, Trp, 
15 Tyr, Phe, His, Gly, Gly- Pro, or is deleted. 

35. The heterologous fusion protein of Claims 27, 28, or 29 
wherein the GLP-1 compound comprises the sequence of formula 
VI II (SEQ ID SO: 111 . 

20 

7 8 9 10 11 12 13 14 15 16 17 
xaa -Xaa-Glu~6ly~Thr - Phe- Thr- Ser - Asp-Xaa- Ser- 
IB 19 20 21 22 23 24 25 26 27 28 
Xaa-Xaa-Xaa~Glu-Xaa~Xaa-Ala-Xaa-Xaa--Xaa-Phe~ 
25 29 30 31 32 33 34 35 35 37 

Ile-Xaa -Trp -Leu-Xaa -Xaa- G ly -Xaa - R 
formula VIII (SEQ ID NO: 11} 

wherein: 

30 Xaa at position 7 is: L-hlsfcidine, D-his tidine, desamino- 
histidin©, 2~amino~histidine, £-hydro:<y~his tidine, 
homohis tidine, <x-f luoromethyl -histidine or a-methyl- 
histidine; 

Xaa at position 8 is: Gly, Ala, or Val; 
35 Xaa at position 16 is: Leu or Val; 
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Xaa 


at position 18 


is 


Lys or Ser; 


Xaa 


at position 19 


is; 


Gin or Tyr; 


Xaa 


at position 


20 




Met or Leu; 


Xaa 


at position 


22 


H- 


Gin or Gin; 


Xaa 


a t po s i fc i o:n 


23 




: Glu, or Gin; 


Xaa 


at position 


25 


is 


: Val ox Ala; 




at position 


26 


is 


; Arg or Lys; 


Xaa 


at position 


27 




Leu or Glu; 


Xaa 


at position 


30 


is 


: Glu or Ala; 


xaa 


at position 


33 


is 


: Val or Lys; 


Xaa 


at position 


34 


is 


: Asn or Lys; 


Xaa 


at position 


36 


is: Gly or Axxj; and 


Xaa 


at position 


37 


is 


: Gly, Pro, Pro-Ser- 


Pro-Pro-Ser, 


or 


is 


absent . 



15 

36. The heterologous fusion protein of claims 27 through 35 
wherein the GLP-1 compound has no more than 6 amino acids 
that are different from the corresponding amino acid in GLP- 
I (7 -37 } OH, GLP-1 {7-36} OH, or Exendin-4. 

20 

37. The heterologous fusion protein of Claim 36 wherein the 
GLP-1 compound has no more than 5 amino acids that differ 
from the corresponding amino acid in GLP-1 (7-37} OH, GLP-1 (7- 
36) OH, or Exendin-4. 

25 

38. The heterologous fusion protein of Claim 37 wherein the 
GLP-1 compound has no more than 4 amino acids that differ 
from the .corresponding amino acid in GLP-1 (7-37 } OH, GLP-1 {7~ 
3 6} OH, or Exendin-4. 

30 

39 . The heterologous fusion protein of Claim 38 wherein the 
GLP-1 compound has no more than 3 amino acids that differ 
from the corresponding amino acid in GLP-1 (7-37) OH, GLP-1 (7- 
3 6) OH, or Exendin-4. 



35 
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40. The 'heterologous fusion protein of Claim 39 wherein the 
GLP-1 compound has no more than 2 amino acids that differ 
from the corresponding amino acid in GLP-1 {7 -37 } OH, GLP-1 (7- 
36) OH, or Bxendi.n-4. 

5 

41. The heterologous fusion protein of any one of Claims 27 
through 40 wherein Xaa at position S is glycine or valine. 

42. The heterologous fusion protein of Claim 34 wherein the 
10 GLP-1 compound has no more than 2. amino acids that differ 

from the corresponding amino acid in GLP-1 < 7 -37 } OH or GLP-- 
!{7-36}OH and Xaa at position 8 is glycine or valine and Xaa 
at position 30 is alanine, glutamic acid, aspartie acid, 
serine, or histidine. 

15 

43 . The heterologous fusion protein of Claim 42 wherein Xaa 
at position 30 is Glutamic acid. 

44. The heterologous fusion protein of Claim 34 wherein 
20 the GLP-1 compound has no more than 2 amino acids that 

differ from the corresponding amino acid in GLP-1 (7-37) OH or 
GLP-1 (7-36) OH and Kaa at position 8 is glycine or valine and 
Xaa at position 37 is histidine, phenylalanine, tyrosine, 
tryptophan, aspartie acid, glutamic acid, serine, threonine, 
25 arginine, or lysine. 

45 . The heterologous fusion protein of Claim 44 wherein Xaa 
at position 37 is histidine. 
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46, The heterologous fusion protein of Claim 34 wherein the 
GLP-1 compound has no more than 2 arai.no acids that differ 
from the corresponding amino acid in GLP-1 (7—37) OH or GLP- 
5 1(7- 36) OH and Xaa at position 8 is glycine, valine, leucine, 
i so leucine, serine, threonine, or methionine and Xaa at 
position 22 is asparfcic acid, glutamic acid, lysine, 
arginine, asparagine, glutamine or histidine. 

10 47. The heterologous fusion protein of Claim 4 5 wherein Xaa 
at position 22 is lysine or glutamic acid. 



48, The heterologous fusion protein of Claim 34 wherein the 
GLP-1 compound is Val-8-GLP-l {7-37 ) . 

49. The heterologous fusion protein of Claim 34 wherein the 
GLP-1 compound is Gly~8-GLP~1 {7-37} . 



50. The heterologous fusion protein of any one of Claims 27 
20 through 49 wherein the second polypeptide is the Fc portion 

of an lg selected from the group consisting of: IgGl, XgG2 , 
IgG3, XgG4, IgE, IgA, IgD, or IgM. 

51. The heterologous fusion protein of any one of Claims 27 
25 through 50 wherein the second polypeptide is the Fc portion 

of an Ig selected from the group consisting of: IgGl, IgG2 , 
IgG3, and XgG4, 



52. The heterologous fusion protein of Claim 51 wherein the 
30 second polypeptide is the Fc portion of an IgGl 

immunoglobulin . 

53, The heterologous fusion protein of Claim 51 wherein the 
second polypeptide is the Fc portion of an IgG4 

3 5 immuno g 1 obui in . 
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54. The heterologous fusion protein of Claims 50 through 53 
wherein the igG is htiraan . 

5 55, The heterologous fusion protein of any one of Claims 27 
through 54 wherein the Fc portion comprises the hinge, CH2 , 
arid CH3 domains. 

56. The heterologous fusion protein of Claim 52 wherein the 
10 second polypeptide has the sequence of SEQ ID NO: 32, 

57. A polynuc 1 eo t ide encoding a heterologous fusion protein 
of any one of Claims 1 through 56. 

15 58, A vector comprising the polynucleotide of Claim 57. 

59. A host cell comprising the vector of Claim 58. 

60. A host cell expressing at least one heterologous fusion 
20 protein of any one of Claims 1 through 56. 

61. The host ceil of Claim 60 wherein said host cell is a 
CHO ceil. 

25 62. A process for producing a heterologous fusion protein 
comprising the steps of transcribing and translating a 
polynucleotide of Claim 57 under conditions wherein the 
heterologous fusion protein is expressed in detectable 
amounts . 

30 

63. A method for normalising blood glucose levels in a 
mammal in need thereof comprising the administration of a 
therapeutically effective amount of the heterologous fusion 
protein of any one of Claims 1 through 56. 

35 
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64 . A method of treating a patient with, non-insulin 
dependent diabetes nielli tus comprising the administration of 
a therapeutically effective amount of the heterologous 
fusion protein of any one of Claims 1 through 56. 

5 

65 . A method of treating obesity comprising the 

admirri s era t ion of a therapeutically effective amount of the 
heterologous fusion protein of any one of Claims 1 through 
56. 

10 

66. The use of a heterologous fusion protein as claimed in 
any one of Claims 1 through 56 for the manufacture of a 
medicament for the treatment of patients with non- insulin 
dependen t diabetes mel 1 i tu s , 

15 

67 . The use of a heterologous fusion protein as claimed in 
any one of Claims 1 through 56 for the manufacture of a 
medicament for the treatment of patients with obesity. 

20 68. A pharmaceutical formulation adapted for the treatment of 
patients with non-insulin dependent diabetes comprising a 
heterologous fusion protein of any one of Claims .1 
through 56. 

25 6.9. The heterologous fusion protein of Claims 1, 2, or 3 
wherein the GLP-1 compound comprises the sequence of 
formula XX [SEQ ID NO: 12] 

Xaa 7 -Xaa 3 - Glu •• Gly - Thr-Xaa i2 -Thr - S er -Asp-Xaa x6 - Ser - 
30 Xa &% s - Xaa3.9 -Xaa.20 -Glu-Xaa.22 - Gin- A l a- X aa 25 - hy s - Xaa 2 -j - 

Phe- Ile-Xaa 3 f>-Trp-Leu~Xaa 3 3 -Lys~Gly ~Arg-Xaa 37 
Formula IX <SEQ ID NO; 12} 

wherein; 

35 xaa 7 is: L-histidine, D~histidine f desaroino-histidine, 2- 
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amino -hi s fcidine , {3 -hydr axy-hi s ti d iiie , 
homohistidine, a-fluoromethyi-histidirAe, or ex- 
methyl -hi s t idine ; 

Xaag is: Ala, Gly, Val, Leu, He, Ser, or Thr; 
5 Xaai2 is: Phe, Trp, or Tyr; 

Xaaxs is: Val, Trp, lie, Leu, Phe, or Tyr; 

Xaa is is: Ser, Trp, Tyr, Phe, Lys, lie, Leu, Val; 

Xaa 19 is: Tyr, Trp, or Phe; 

Xaa ;;0 is; Leu, Phe, Tyr, or Trp; 
10 Xaa 32 is: Gly, Glu, Asp, or Lys; 

Xaa 25 is: Ala, Val, He, or Leu; 

Xa.e-2i is: Glu, He, or Ala; 

Xaa^c is; Ala or Glu 

Xaa 33 is; Val, or lie; and 
15 Xaa 37 is: Gly, His, Mi 2 , 02: is absent. 

70. The heterologous fusion protein of Claim 69 wherein the 
second polypeptide is human albumin. 

20 71, The heterologous fusion protein of Claim 70 wherein the 
second polypeptide has the sequence of SEQ ID MO: 3 4.. 

72 . The heterologous fusion protein of Claim 59 wherein the 
second po3.ypept.ide is an N- terminal fragment of albumin. 

25 

73 . The heterologous fusion protein of any one of Claims 
27, 28, or 29 wherein the GLP-1 compound comprises the 
sequence of formula. XX [SEQ ID NO; 12] 

3 0 Xaa^-Xaas-Glu-Gly-Thr-Xaais-Thr ■■ Ser-Asp-Xaa ss ~Ser~- 

Xaa 18 -Xaa :J .9~Xaa2a~Glu"Xaaa 2 -Gln-Ala-Xaa 3s -Lys-Xaa?.r- 
Phe-Ile-.Xa.a30— Trp-Leu-Xaaaa-Lys-Gly -Arg»Xaa 37 
Formula IX {SEQ ID ISO: 12} 
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wherein : 
Xaa? is 



Xaa 8 is: 

Xaa^a is : 

Xaais is: 

Xaaig is; 

Xaa L9 is: 

Xaa^g is : 

Xaa;?2 is; 

Xaaaa is : 

Xaa 27 is: 

Xaaso is: 

Xaasj is: 



L-histidine, D-histidine, desandno-histidine, 2- 
ami no~his tidine , p-hydzxsiey-his tidine, 
homohistidine, a- f luorome thy 1 -hi s tidine , or ri- 
me thy 1 -hi stidine ; 

Ala, Gly, Val, leu, He, Ser, or Thr; 

Phe, Trp, or Tyr; 

Val, Trp, He, Lea, Phe, or Tyr; 

Ser, Trp, Tyr, Phe, Lys, He, Leu, Val; 

Tyr, Trp, or Phe; 

Leu, Phe, Tyr, or Trp; 

Gly, Glu, Asp, or Lys; 

Ala, Val, lie, or Leu; 

Glu, He, or Ala; 

Ala or Glu 

Val, or lie; and 



Xaa 



is: Gly, 



y< is :;bsen 



20 74. The heterologous fusion protein of Claim 73 wherein the 
second polypeptide is the Fc portion of an Ig selected front 
the group consisting of: IgGl, IgG2 , IgG3, IgG4, IgE, igA, 
IgB, or IgM. 



75 . The heterologous fusion protein of Claim 73 or 74 
wherein the second polypeptide is the Fc portion of an Ig 
selected from the group consisting of: IgGl, lgG2, IgG3 , and 
IgG4, 



30 



75. The heterologous fusion protein of Claim 7 5 wherein the 
second polypeptide is the Fc portion of an IgGl 
immunoglobulin . 
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77, The heterologous fusion protein of Claim 75 wherein 
the second polypeptide is the Fc portion of an lgG4 
imraunoglobi.il. in . 



78. The heterologous fusion protein of Claims 73 through 
78 wherein the igG is human. 

79. The heterologous fusion protein of Claim 75 wherein 
the second poLypep do ha i< < ; j - SEQ D NO: 
32. 



80. The heterologous fusion protein of any one of Claims 
69 through 79 wherein the GLP-1 compound is selected 
from the group consisting of Giy 8 -GLP-1 (7 -37} , Val 8 - 

15 Tyr 12 ~GLP-l(7-37), Val 8 -Tyr i2 -GLP~1 {7-36} , Val s -Trp 12 ~ 

GLP-K7-37), Val 8 -Leu i5 -GLP-l{7-37) , Val 8 -Val 2S -GLP-l {7- 
37 5 , Val s ~Tyr lfi ~GLF- 1 (7-37) , Gly 8 -Glu 22 -GLP-1 {7-37} , 
Val 8 -Glu 22 -GLP-l<7-3?) ,Val 8 -Leu 2S -GLP-1 (7-37) , Val 8 - 
Tyr 12 -Tyr ls ~GLP-l (7-37 ) , Wal 8 -Trp ia -61u 22 -QLP-l (7-37 ) , 

20 Val 8 ~Tyr 12 -Giu 22 -GLP-l {7-37 } , Val s -Tyr i6 -Phe" iS -GLP-l {7- 

37), Val 8 ~Tyr X5 ~Glu 32 -GLP-l {7-37) , Val 8 ~Trp 16 ~Glu 2s -GLP~ 
1 (7-37) , Val 8 ~Lau"-Giu 22 -GLP-l (7-37 } , . Val 3 - Jle is -Glu 22 - 
GLF-l (7,-37) , Val 8 -Phe w -Glu 23 -GLP-1 (7-3? } , Val 8 -Trp ia - 
Glu 22 -GLP-1 {7-37} , Val 8 -Tyr 18 -Glu 22 -GLP-1 {7-37} , Val 8 - 

25 ?he i8 -Glu 22 -GLP-l {7-37} , Val 8 -Xle 18 ~Glu 22 -GLP-l {7-37} , 

Val 8 -Lys 18 -Glu 22 -GLP-H7-37) , Val 8 -Trp 19 -Glu 22 -GLP-1 ( 7~- 
37) .. Val 8 -Phe xs -Glu 22 -GLP-1 (7-37) , Val 8 -Phe 20 -Glu 22 -GLP- 
1(7-37) , Val 8 -Glu 22 -Leu 2s -GLP-l<7-37) , Val s -Glu 22 -Ile 25 - 
GLP-1 (7-37) , Val 8 -Glu"-Val 35 -GLP-1 (7-37 ) , ¥al 3 -Glu 22 - 

30 Ile 27 -GLP~l{7-37) , Val 8 -Glu 22 -Ala 2? ~GLP~l (7-37) , Val 8 - 

Glu 22 -Ile 33 -GLP~1 (7-37) , Val s -Asp 9 -Ile :ll ~Tyr^-Glii 22 -GLP- 
1(7-3?) , Val 8 -Tyr 18 -Trp 19 -Glu 22 -GLP-1 (7-37) , Val 8 -Trp 26 ~ 
G.lu 22 -Val 2S -Ile 33 ~GLP~l (7-37) , Val 8 ~Trp 16 -Glu 22 -Ile 33 ~GLP~ 
1(7-37} , Val 8 -Glu 22 -Val 25 -Ile 33 -Gr,F-l{7-37) , Val 8 -Trp 16 - 
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Gl^-Val^- GLP-K7-37), and Val -Cys" -Lys~ fc ~GLP~l f 7- 
37) . 
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10 






IS 






.t re Lys 


Ser Cy s 




Lys 


.tiii. riis inr 


Cys Pro 


Pro 


Cys 








20 






25 






30 


Pro 




rrTO GlU 


Lys Gly 


Gly 


Pro 


Ser Val Phe 


Leu Phe 


Pro 


Pro 














40 






4 5 




ro 


Lys Asp 


*I iir Lys 




lie 


Ser Arg Thr 


Pro Glu 


Val 


Thr 








50 






55 






60 


s 


ai 




Asp Val 




w 


Glu Asp Pro 


- 

Glu val 


Lys 


Pee 








if 5 






70 






75 


^sn 


Frp 


Tyr Val 


Asp Gly 


Val 


Glu 


Val His Asn 


Ala Lys 


Thr 


Lys 




















90 


T.O 


Arg 




Gin Tyr 


Asn 


Ser 


Thr Tyr Arg 


Val Val 


Ser 


Val 














100 






105 


Leu 


r 


val Leu 


Lis Gin 


Asp 


Trp 


Leu Asn Gly 


Lys Glu 


Tyr 


Lys 














115 






120 


Cys 


ys 


a oer 


As ii I-'VS 


a 




Pro Al a Pro 


lie Glu 


Lys 


Thr 








125 






130 






135 


lie 


Ser 


Lys Ala 


Lys Gly 


Gin 


Pro 


Arg Glu Pro 


Gin Val 


Tyr 


Thr 








140 






145 






150 


Leu 


Pro 


Pro Ser 


Arg Glu 


rlu 


v-t 


Tnr Lys Asn 


Gin Va3 


Ser 










155 






160 






165 


Thr 


Cys 


Leu Val 


Lys Gly 


Phe 


Tyr 


Pro Ser Asp 


lie Ala 


Val 


Glu 








170 






175 






180 


Trp 


Glu 


Ser Asn 


Gly Gin 


Pro 


Glu 


Asn Asn Tyr 


Lys Thr 


Thr 


Pro 








185 






190 






195 


Pro 


Val 


Leu Asp 


Ser Asp 


Gly 


Ser 


Phe Phe Leu 


Tyr Ser 


Lys 


Leu 








2 00 






2 05 






210 


Thr 


Val 


Asp Lys 


Ser Arg 


Trp 


Gin 


Gin Gly Asn 


Val Phe 


Ser 


Cys 








215 






220 






225 


Ser 


Val 


Met His 


Glu Ala 


Leu 


His 


Asn His Tyr 


Thr Gin 


Lys 


Ser 








230 












Leu 


Ser 


Leu Ser 


Pro Gly 


Lys 




[SEQ ID MO: 


: 32] 
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5 




10 




15 








Asp 


Ala 


His 


Lys Ser Glu Val Ala 


His 


Arg 


Phe Lys Asp Leu 


Gly 
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GlU 


Asn 




20 




2S 






30 






35 




Phe 


Lys 


Ala 


Leu Val Leu He Ala 




Ala 


Gin Tyr Leu Gin 


Gin 


Cys 


Pro 


Phe 








40 


45 




50 










Glu 


ASP 




Val Lys Lei; Val Aran 




Val 


Thr Glu Phe Ala 


Lys 


Thr 


Cys 


Val 


55 






60 






65 




70 






Ala 


ASp 


Gin 


Ser Ala Glu Asa Cys 


Asp 


Lys 


Ser Leu His Thr 


Leu 


Phe 


Gly 


Asp 






75 


80 






85 








30 


!.,ys 


Leu 


Cys 


Thr Val Ala Thr Lea 


Arg 


Glu 


Thr Tyr Gly Glu 


Met 


Ala 


Asp 


Cys 








SS 




100 




10S 








Cys 


Ala 


Lys 


Gin Glu Pro Glu Arg 


Asn 


Glu 


Cys Phe Leu Gin 


His 


Lys 


Asp 


Asp 




110 




3.15 






120 






125 




Asxs 


Pro 


Asa 


Leu Pro Arg Lea Val 


Arg 


Pro 


Glu Val Asp Val 


Met 


Cys 


Thr 


Ala 








13 0 


135 




14 0 










Phe 


Hi s 


Asp 


Asn Glu Glu Thr Phe 


Leu 


Lys 


Lys Tyr Leu Tyr 


Glu 


He 


Al a 


Arg 


145 












155 










Arg 


His 


Pro 


Tyr Phe Tyr Ala Pro 


Glu 


Leu 


lieu Phe Phe Ala 


Lys 


Arg 


Tyr 


Lys 






165 








3.75 








ISO 


Ala 


Ala 


Phe 


Thr Glu Cys Cys Gin 


Ala 


Ala 


Asp Lys Ala Ala 


Cys 


Leu 


Leu 


Pro 








185 




190 




195 








Lys 


Leu 


Asp 


Glu Leu Arg Asp Glu 


Gly 


Lys 


Ala Ser ser Ala 


Lys 


Gin 


Arg 


Leu 




200 




205 






210 






215 




Lys 


Cys 


Ala 


Ser Leu Gin Lys Phe 


Gly 


Glu 


Acg Ala Phe Lys 


Ala 


Trp 


Ala 


Val 








220 


225 




230 










Ala 


Arg 


Leu 


Ser Gin Arg Phe Pro 


Lys 


Ala 


Glu Phe Ala Glu 


Val 


Ser 


Ays 


Leu 


235 






240 






24 5 




250 






Val 


Thr 


Asp 


Leu Thr Lys Val His 


Thr 


Glu 


Cys Cys His Gly 


Asp 


Leu 


.Leu 


Glu 






3SS 


360 






265 








270 


Cys 


Ala 


Asp 


Asp Arg Ala Asp Leu 


Ala 


Lys 


Tyr He Cys Glu 


Asn 


Gin 


Asp 


Ser 








275 




280 




285 








He 


Ser 


Ser 


Lys Leu Lys Glu Cys 


Cys 


Glu 


Lys Pro Leu Leu 


Glu 


Lys 


Ser 


His 








295 






300 






305 




Cys 


He 


Ala 


Glu Val Glu Asn Asp 


Glu 


Met 


Pro Ala Asp Leu 


Pro 


Ser 


Leu 


Ala 








310 


315 




320 










Ala 


Asp 


Phe 


val Glu Ser Lys Asp 


Val 


Cys 


Lys Asn Tyr Ala 


Glu 


Ala 


Ays 


Asp 


325 






330 






335 




340 






val 


Phe 


Leu 


Gly Met Phe Leu Tyr 


Glu 


Tyr 


Ala Arg Arg His 


Pro 


Asp 


Tyr 


Ser 






345 


350 






3SS 








360 


Val 


Val 


Leu 


Leu Leu Arg Leu Ala 


Lys 


Thr 


Tyr Glu Thr Thr 


Leu 


Glu 


Lys 


Cys 








365 




370 




37S 








Cys 


Ala 


Ala 


Ala Asp Pro Lxa eAu 


'-h'S 


Tyr 


Ala Lys Val Phe 




Glu 


Phe 


Lys 




330 




385 






3.90 






3 95 




Pro 


Leo 


Val 


Glu Glu Pro Gin Asn 


Leu. 


lie 


Lys Gin Asn Cys 


Glu 


Leu 


Phe 


Glu 








400 


405 




410 










Asn. 


Leu 


sly 


Glu Tyr Lys Phe Gin 


Asn 


Ala 


Leu Leu Val Arg 


Tyr 


Thr 


Lys 


Lys 


415 






420 






425 




430 






Val 


Pro 


Gin 


Val Ser Thr Pro Thr 


Leu 


Val 


Glu Val Ser Arg 




Leu 


Gly 


Lys 






435 


440 






445 










Val 


Gly 






Pro 


Glu 


Ala Lys Arg Met 






Ala 










455 




460 












Asp 


Tyr 


Leu 


Ser Val Val Leu Asn 


Gin 




Cys Val Leu His 


Glu 




Thr 


Pro 




47 0 




475 






480 






485 




Val 


Ser 


Asp 


Arg Val Tnr Lys Cys 


Cys 


Thr 


Glu Ser Leu Val 


Asn 


Arg 


Arg 


Pro 
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490 






495 


500 




Cys 


Phe 


Ser 


Ala Leu 


Glu 


Val Asp 


Glu 


Thr Tyr Val Pro Lys Glu 


Phe Asa Ala 


505 








510 






SIS 


520 


Glu 


Thr 


Phe 


Thr Phe 


His 


Ala Asp 


Xls 


Cys Thr Leu Ser Glu Lys 


Glu Arg Gin 






525 






530 




535 


540 


lie Lys 


Lys 


Gin :'hr 


Ala 


lieu Val 




Leu Val Lys His Lys Pro 


Lys 7ila Thr 
















550 555 




IjYS 


Glu 


Gin 


Leu Lys 


Ala 


Val Met 


Asp 


Asp Phe Ala Ala Phe Val 


Glu Lys Cys 




5S0 








565 




570 


575 


Cys 


Lys 


Ala 


Asp Asp 


Lys 


Glu Thr 


Cys 


Phe Ala Glu Glu Gly Lys 


Lys x.eu Val 








530 












Ala 


Ala 


Ser 


Gin Ala 


Ala 


Leu Gly 


Leu 
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250--* 
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98 »> 
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3(1 — * 







B. 

KDa 12 3 4 5 6 7 




? V ^jnir i^.m % ., J,. 

us ► 
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Fig. 7 
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Fig. 8 
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Fig. 9 
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A. 
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B. 
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A. 
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Fig , 12 



1 

GAGCCCAAATCTTGTGACAAA&CTCACACATGCCCACCGTGCCCA.GCACC 
50 

TGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAACCCAAGG 
100 

ACACCGTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGAC 
150 

GTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGT 
200 

GGAGGTGCATAATGCCAA<3ACAAAGCCGCGGGAGGAGCAGTACAACAGCA 
250 

cgtaccgtgtggtcagcgtcctcacc:gtcctgcaccaggactggctgaat 

300 

ggcaaggagtacaagtgcaaggtctccaacaaagccctcccagcccccai 1 

350 

cgagaaaaccatctgcaaagccaaagggcaggcccgagaaccacaggtgt 

4 00 

acaccctgcccccatcccgggaggagatgaccaagaaccaggtcagcctg 

450 

acctgcctggtcaaaggcttctatcccagcgacatcgccgtggagtggga 

500 

gagcaatgggcagccggagaacaactacaagaccacgcctcccgtgctgg 

550 

ACTCCGACGGCTCCTTCTTCCTC'XATAGCAAGCTCACCGTGGACAAGAGC 
600 

AGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGi'GA'I?G(:.'A'r<?tAGGCTCT 
650 

GCACAACCACTA^CGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATGAT 
70 0 

AGT [SEQ ID FO: 33 J 
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Fig. 13 



1 

1 ' iCOC CAAG. JT' V - ? TTGC1X1ATCCGTTTAAAGATTTGGGAGAAGA 
50 

AAM'TTCAAAGCCTTGGTGTTGATTGCCTTTGCTCAGTATCTTC&GCAGT 
100 

GTCCATTTGAii.C3ATCATGTAAAATTAGTGAATGAAGTAACTGAATTTGCA 
15 0 * 

AAAACATGTGTTGCTGA'!'GAGTGAGCTGAAAATTGTGACAAATC?iCTTCA 
200 

TACCCTTTTTGGAGACAAATTATGCACAGTTGCAACTCTTCGTGAAACGT 
250 

ATGGTGAAATGGCTGACTGCiGTGCAAAACAAGAACCTGAGAGAAATGAA 
300 

TGCTTCTTGCAACACAAA.GATGACAACCCA.AACCTCCGCCGATTGGTGAG 
350 

ACCAGAGGTTGATGTGATGTGCACTGCTTTTCATGACAATGAAGAGACAT 
400 

TTTTGAAAAAATACTTATATGAAATTGCCAGAAGACATCCTTACTTTTAT 
450 

GCCCCGGAACTCCTTTTCTX'TGCTAAAAGGTATAAAGCTGCTTTTACAGA 
500 ' 

ATGTTGCCAAGCTGCTGATAAAGCTGCCTGCCrGTTGCCAAAGCTCGATG 
550 

AACTTCGGGATGAAGGGAAGGCTTCGTCTGCCAAACAGAGACTCAAGTGT 
600 

GCCAGTCTCCAAAAATTTGGAGAAAGAGCTTTCAAAGCATGGGCAGTAGC 
650 

XCGGCTGAGCGAGAGATTTCCCGAAGICTGAGTTTGCAGAAGTTXCCAAGT 
700 

TAGTGACAGATCTTACCAAAGTCCACACGGAATGCTGCCATGGAGATCTG 
750 

CTVGAATGG "I = l\ l VTTGG_AAG AXA1 G7GGG AAAA 

800 

TGAaGATTCGA-rGI : GCAGTAAACTGAAGGAATGCTGTGAAAAACCTCTGT 
850 

TGGAAAAATCCCL\CTGCATTGCCGAAGTGGAAAJkTC^\TGAGATGCCTGCT 
900 

GACTTGC TTC [ 1 3ATTTTGTTGAAAGTAAGG i 
350 

AA ACTATGC T'GAGGC AA AGGATGT C TT C C TG G GC ATGTT T TTGT ATG AAT 
1000 

ATGCAAGAAGGCArCCTGATTACTCTGTCGTGCTGC'TGCTGAGACTTQCC 
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Fig. 13 Continued 



1050 

AaGACATATGAAACOiCTCTAGAGAAGTGCTGTGCCGCTGCAGATCCTCA 
1100 

TGAATGCTATGCCAAAGTGTTCGATGAATTTAAACCTCTTGTGGAAGAGC 
1150 

CT'CAi^AAT'ri^VATCAAACAAAATTGTGAGCTTTTTGAGCAGCTTGGAGAG 
1200 

TACAAATTCCAGAJ T GCTAT 31 < ' tV WXAAG - i/ACCCCA 
12 SO 

AGTGTCAACTCCAACTCTTGTAGAGGTCTCAAGAAACCTAGGAAAAGTGG 
1300 

GCAGCAAATGTTG1AAACATCCTGAAGCAAAAAGAATGCCCTGTGCAGAA 
1350 

GACTATCTATCKGTGGTCCTGAACCAG^rTATGTGTGTTGCATGAGAAAAC 
1400 

GC(:AG'rAAGT ; GAC:i\CiA<Jl ! CACC^AATGCTGCACAGAATCCTTGGTGAACA 
1450 

GGCGACCATGCTTTTCAGCTCTG<3AAGTCGATGAAACATACGTTCCCAAA 
1500 

Gi\GTTT: : AATGCTGAAACATTCACCTTCCATGCAGATATATGCACACTTTC 
1550 

TGAGAAGGAGAGACIAAATCAAGAAACAAACTGCACTTGTTGAGCTCGTGA 
1600 

AJICJi.CAAGCCCAAGGCAACAAAAGAGCAACTGAAAGCTGTTATGGATGAt' 
1650 

TTCGCAGCTTTTGTAGAGAAGTGCTGCAAGGCTGACGATAAGGAGACCTG 
1700 

CTTTGCCGAGGAGGGTAAAAAACTTGTTGCTGCAAGTCAAGCTGCCTTAG 
1750 

GCTTATAATGAC [SEQ ID HO: 35] 
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X-13991. ST25.txt 
SEQUENCE LISTING 

<110> Eli Lilly and Company 

<120> GLP-1 FUSION PROTEINS 

<130> X-13991 

<150> US 60/251,954 
<151> 2000-06-12 

<160> 35 

<170> Patentin version 3.1 

<210> 1 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 1 

His Ala 61 u 61 y Thr Phe Thr ser Asp Val Ser ser Tyr Leu Glu Gly 
1 5 10 15 

Gin Ala Ala tys Glu Phe He Ala Trp Leu val lys 61 y Arg 61 y 
20 25 30 

<210> 2 
<211> 39 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> synthetic construct 
<220> 

<221> MIS C m F EATU R £ 

1 



PCI71) SOI/43165 

X-13991. ST2S.txt 



<222> C2)..(2) 

<223> xaa at position 2 is Ala, Gly, ser, Tbr, Leu, lie, val, 61 u s Asp 
or Lys; 



<22G> 

<221> SCTSC FEATURE 

<222> C3),,(3) 

<223> xaa at position 3 is Glu, Asp, or Lys; 



<220> 

<221> MISCFEATURE 
<222> 0),.(5) 

<223> Xaa at position 5 is Thr, Ala, Gly, ser. Leu, lie, val, Glu, Asp, 
or Lys; 



<220> 

<221> MISCFEATURE 
<222> C8) . . (8) 

<223> xaa at position 8 is Ser, Ala, Gly, Thr, Leu, lie, val, Glu, Asp, 
or Lys; 

<220> 

<Z21> MISCFEATURE 
<222> (10) . . (10) 

<223> xaa at position 10 is val, Ala, Gly, ser, Thr, Leu, lie, Tyr, Glu 
, Asp, or Lys; 



<220> 

<221> MISCFEATURE 
<222> (11) . . (11) 

<223> xaa at position 11 is Ser, Ala, Gly, Thr, teu, lie, val, Glu, Asp 
, or Lys; 



<220> 
<22I> 
<222> 



MISCFEATURE 
C12) . . (12) 
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a- .iio. TXT: 

<223> xaa at position 12 is Ser, Ala, Gly, Thr, Leu, lie, val , Glu, Asp 
, Lys, Trp or Tyr; 

<220> 

<221> ffiSCFFATURE 
<222> (13),. (13) 

<223> xaa at position 13 is Tyr, Phe, Trp, Glu, Asp, Gin, or Lys; 
<220> 

<22I> MISC_FEATURE 
<222> (14} . - (14) 

<223> Xaa at position 14 is Leu, Ala, Gly, ser, Thr, lie, val, Glu. Asp 
, Met, Lys, Trp or Tyr; 

<220> 

<221> MI SC_FEATURE 
<222> (IS) . . (.15) 

<223> Xaa at position 15 is Glu, Asp, or Lys; 
<220> 

<221> MISCJFEATURE 
<222> (16) . . (16) 

<223> xaa at position 16 is Gly, Ala, Ser, Thr, Leu, He, val, Glu, Asp 
, Trp or Lys; 

<220> 

<221> MISC...FEATURE 
<222> (17) , , (17) 

<223> Xaa at position 17 is Gin, Asn, At-q, Glu, Asp, or Lys; 

<220> 

<221> HISC-FEATURE 
<222> (18) . . (18) 

3 
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<223> xaa at position 18 is Ala, Gly, s<at% Thr, Leu, He, Val , Arg, Glu 
, Asp, or tys; 

<220> 

<221> MI5C_FEATURE 

<222> (13).. (19) 

<223> Xaa at position 19 is Ala, Gly, Ser s Thr, Leu, lie, Val , Glu, Asp 
, or Lys; 

<220> 

<221> MISCFEATURE 

<222> (20) . . (20) 

<223> Xaa at position 20 is Lys, Arg. Gin, Glu, Asp. or His; 
<220> 

<221> MISCLFEATURE 

<222> ill) . , (21) 

<2?3> Xaa at position 21 is Leu, Glu, Asp, or Lys; 
<220> 

<221> MI SC„ FEATURE 

<222> (24)., (24) 

<223> Xaa at position 24 is Ala, Gly, Ser, Thr, Leu, lie, Val, Glu, Asp 
, or Lys; 

<220> 

<221> MISC_F EATURE 

<222> (25)., (25) 

<223> Xaa at position 25 is Trp, Phe, Tyr, Glu, Asp, or Lys; 

<220> 

<22I> MISCJr EATURE 

<222> (26) . . (26) 

4 
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<223> xaa at position 26 is Leu, Gly, Ala, Ser, Thr, lie, Val , Glu, Asp 
, or Lys; 

<220> 

<221> MISCFEATURE 
<222> (27).. (27) 

<223> xaa at position 27 is val , Gly, Ala, Ser, Thr, Leu, lie, Glu, Asp 
, or Lys ; 

<220> 

<221> MISCFEATURE 
<222> (28) . . (28) 

<223> Xaa at position 2S is Asn, Lys, Arg, Glti, Asp, or His; 
<220> 

<221> MISCFEATURE 
<222> (29) . . (29) 

<223> Xaa at position 29 is Gly, Ala, Ser, Thr, Leu, lie, val, Glu, Asp 
, or Lys; 

<220> 

<221> MISCFEATURE 
<222> (30) . . (30) 

<223> xaa at position 30 is Gly, Arg, Lys, Glu, Asp, or His; 
<22Q> 

<223> MISCFEATURE 
<222> (31) . , (31) 

<223> Xaa at position 31 is Pro, Gly, Ala, Ser, Thr, Leu, Tie, Val, Glu 
, Asp, or Lys, or is deleted; 

<220> 

<221> MISCFEATURE 

5 
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<222> (32).. (32) 

<223> Xaa at position 32 is Ser, Arg, Lys, Glu, Asp, or His, or is dale 
ted ; 



<22G> 

<221> MISCwFEATURE 
<222> (33) . . (33) 

<223> Xaa at position 33 is Ser, Arg, Lys, Gin, Asp, or His, or is dele 
ted; 



<220> 

<221> MXS€_FEATURE 

<222> (34).. (34) 

<223> xaa at position 34 is Gly, Asp, Glu, or Lys, or is deleted; 



«22Q> 

<221> MXSCFEATURE 
<222> (35).. (35) 

<223> Xaa at position 35 is Ala, Phe, Trp, Tyr, Gfu, Asp, or Lys, or is 
deleted; 



<220> 

<221> MISC„FEATURE 

<222> (36).. (36) 

<223> xaa at position 36 is ser, Pro, Lys, Glu, or Asp, or is deleted; 



<220> 

<221> MISC_ FEATURE 

<222> (37) . . (37) 

<223> Xaa at. position 37 is Ser, Pro, Glu, Asp, or Lys, or is deleted; 



<220> 

<221> MI5C..FEATORE 



WO 02/46227 PCT/US01/43165 

X-13991.ST25.txt 

<222> (38).. Cm 

<223> xaa at position 38 is Gly, Pro, Glu, Asp, or Lys, or is deleted; 



<220> 

<221> ffiS C„F E ATU R E 
<222> (39)., (39) 

<223> xaa at position 39 is Ala, ser, val , Glut, Asp, or Lys, or is dele 
ted ; 



^400> 2 

His Xaa Xaa Gly Xaa Phe Thr Xaa Asp xaa Xaa Xaa Xaa Xaa Xaa Xaa 



Xaa Xaa Xaa Xaa Xaa Phe lie Xaa xaa xaa Xaa xaa xaa xaa Xaa xaa 
20 25 30 



<210> 3 

<211> 32 

<212> PRT 

<213> Artificial sequence 



<22Q> 

<223> synthetic construct 
<220> 

<221> M1SC..FEATURE 

<222> (1} . . CO 

<223> Xaa at position I is L-histidine, D-histidine. or is deleted. 



<220> 

<221> MIS C_ f E ATU R E 

<222> C2) . . C2) 

<223> Xaa at position 2 is Gly, Ala, val, Leu, He, Ser, or Thr; 



<220> 



7 
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<22I> MISCL.FEATURE 
<222> (3)..C3) 

<223> xaa at position 3 is Thr, Sep, Arg, Lys, Trp, Phe. Tyr, Glu, or H 
is; 

<220> 

<221> MISC_FEATURE 
<222> <S).-C5) 

<223> xaa at position S is Asp, Glu, Arg, Thr, Ala, Lys, or His; 
<220> 

<221> Ml SC„F£ATUR£ 
<222> (6).. (6) 

<223> xaa at position 6 is His, Trp, Phe, or Tyr; 
<220> 

<221> MISC„FEATURE 
<222> (10) . . (10) 

<223> xaa at position 10 is leu, ser, Thr, Trp, His, Phe, Asp, val, Tyr 
, 61 u, or Alia; 

<220> 

<221> MISC„FEATURE 
<222> (12),, (12) 

<223> xaa at position 12 is His, Pro, Asp, Glu, Arg, Ser, Ala, or Lys; 
<22G> 

<221> M I 5C_ FEATURE 
<222> C13)..C13) 

<223> xaa at position 13 is Gly, Asp, G'lu, Gin, Asn, Lys, Arg, or cys; 
<220> 

8 
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<221> MISC„FEATURE 
<222> (17)., (17) 

<223> xaa at position 17 is His, Asp, Lys, Glu, Gin, or Arg; 



<220> 

<221> MIS C_F EAT U RE 

<222> (18) . . (18) 

<223> xaa at position IS is Glu, Arg, Ala, or Lys; 



<220> 

<221> M T.SCL.F EATti R E 

<222> (20) . . (20) 

<223> Xaa at position 20 is Trp, Tyr, phe, Asp, Lys, Glu, or His; 



<220> 

<221> MXSCFEATURE 

<222> (21) . . (21) 

<223> xaa at position 21 is Ala, Glu, His, Phe, Tyr, Trp, Arg, or Lys; 



<220> 

<221> MISCJFEATURE 

<222> (24) , . (24) 

<223> xaa at position 24 is Ala, Glu, Asp, Ser, or His; 



<220> 

<221> MISC_FEATURE 

<222> (25)., (25) 

<223> Xaa at. position 25 is Asp, Glu, ser, Thr, Arg, Trp, or Lys; 



<22G> 

<221> MISCFEATURE 
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<222> (27).. (27) 

<223> xaa at position 27 is Asp, Arg, val , Lys, Ala, Gly, or Glu; 
<?20> 

<221> MISC„FEATURE 
<222> (28) . . (28) 

<223> Xaa at position 28 is Glu, Lys, or Asp; 

<220> 

<223> MXSCFEATURE 
<222> (29) -.(29) 

<223> xaa at position 29 is Thr, Ser, Lys, Arg, Trp, Tyr, Phe, Asp, Gly 
, pro, His, or Glu; 

<220> 

<221> MISCFEATURE 
<222> (30) . . (30) 

<223> Xaa at position 30 is Thr, ser, Asp, Trp, Tyr, Phe, Arg, Glu, or 
Hi s ; 

<220> 

<221> MISQ.FEATURE 
<222> (31) . . (31) 

<223> Xaa at position 31 is Lys, Arg, Thr, Ser, Glu, Asp, Trp, Tyr, Phe 
, His, Gly, or is deleted, 

<220> 

<221> MISCFEATURE 
<22Z> (32),, (32) 

<223> xaa at position 31 is Pro or is deleted, 

<400> 3 

xaa xaa xaa Gly xaa xaa Thr Ser Asp xaa ser xaa xaa Leu Glu Gly 
IS 10 15 



Xaa Xaa Ala Xaa Xaa Phe lie Xaa Xaa Leu xaa Xaa xaa Xaa Xaa xaa 
20 25 30 

10 
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X- 13991.ST2S.txt 

<210> 4 

<211> 32 

<212> PRT 

<:213> Artificial Sequence 
<220> 

<223> synthetic construct 
<220> 

<221> MXSC„FEATUR£ 

<222> CD . - CD 

<223> xaa at position 1 is l-fnstidine, D-histidine, or is deleted, 
<220> 

<221> MXSCFEATURE 

<222> (2) . . (2) 

<223> Xaa at position 2 is Gly, Ala, val , Leu, He, Set% or Thr; 
<220> 

<221> MISC_FEATUR£ 

<222> C5),.C5) 

<223> Xaa at position 5 is Asp, Glu, Arg, Thr, Ala, Lys, or His; 
<220> 

<221> MXSCFEATURE 

<222> C6) . . (6) 

<223> xaa at position 6 is His, rrp, pfie, or Tyr; 

<220> 

<221> MISC..FEATURE 

<222> (10} , . (10) 

<223> xaa at position 10 is Leu, Ser, Thr, Trp, His, Phe, Asp, Va'i, Glu 
, or Ala; 

11 
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<220> 

<221> MISCFEATURE 
<22Z> (16) . . (16) 

<223> Xaa at position 16 is Gly, Asp, Glu, Gin, Asn, lys, Arg, or Cys; 



<220> 

<221> MISCFEATURE 

«222> (17) , , (17) 

<223> Xaa at position 17 is His, Asp, Lys, Glu, or Gin; 
<220> 

<221> MISCLFEATURE 

<222> (IS) , . (18) 

<223> Xaa at position 18 is Glu, His, Ala, or Lys; 
<220> 

<221> MISC_FEATUR£ 

<222> (19) . « (19) 

<223> Xaa at position 19 is Asp, Lys, Glu, or His; 
<220> 

<221> MISCLFEATURE 

<222> (21) . . (21) 

<223> xaa at position 21 is Ala, Glu, His, Phe, Tyr, Trp, Arg, or lys; 
<220>- 

<221> MISC.FEATURE 

<222> (24) , . (24) 

<223> xaa at position 24 is Ala, Glu, Asp, Ser, or His; 



12 
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<220> 

<221> MISCFEATURE 
<222> (27)., (27) 

<223> Xaa at position 27 is Asp, Arg, val , Lys, Ala, Gly, or Glu; 
<220> 

<221> MISCFEATURE 
<222> (28) . . (28) 

<223> Xaa at position 28 is Glu, Lys, or Asp; 
<220> 

<221> MISCFEATURE 
<222> (29) . . (29) 

<223> Xaa at position 29 is Thr, Ser, Lys, Arg, Trp, Tyr, phe, ASp, Gly 
, Pro, His, or Glu; 

<220> 

<221> MISCFEATURE 
<222> (30) . . (30) 

<223> Xaa at position 30 is Arg, Glu, or His; 
<220> 

<221> MISCFEATURE 
<222> (31) . , (31) 

<223> Xaa at position 31 is lys, Arg, Thr, ser, Glu, Asp, Trp, Tyr, phe 
, His, Gly, or is deleted. 

<220> 

<221> MISCFEATURE 
<222> (32).. (32) 

<223> Xaa at position 32 is Pro, or is deleted. 

<400> 4 

Xaa Xaa Glu Glv Xaa Xaa Thr Ser Asp Xaa Ser Ser Tyr Leu Glu Xaa 
1 5 10 15 

13 
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PCI71) SOI/43165 



xaa xaa Xaa Lys xaa Phe lie. xaa Trp Leu xaa Xaa xaa xaa xaa xaa 
20 25 30 

<210> 5 

<211> 32 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<220> 

<221> MI SC., FEATURE 

<222> (1) . . (!) 

<223> Xaa at position 1 is t-histidioe, D-histidine, or is deleted. 
<220> 

<221> MI5CFEATURE 

<222> (2)., (2) 

<223> Xaa at position 2 is Gly, Ala, Val, Leu, lie, ser, Met, or Thr; 
<220> 

<221> MISC. FEATURE 

<222> (6) , . (6) 

<223> Xaa at position 6 is His, Trp, Phe, or Tyr; 

<2?.0> 

<221> MI SC...FEATURE 

<222> (10) . . (10) 

<223> xaa at position 10 is Leu, ser, Thr, Trp, His, Phe, Asp, Val, Glu 
, or Ala; 

<220> 

<221> MI SC_. F EATURE 

<222> (16}.. (16) 

14 
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<223> Xaa at position 16 is Gly, Asp, Glu, Gin, Asn, Lys . Arg, or cys; 
<220> 

<221> MISC_5=FATURE 

<222> (17).. (17) 

<223> Xaa at position 17 is His, Asp, Lys, 61 u, or Gin; 
<22Q> 

<221> MISC_i=EATUR£ 

<?.?.?> (20) . . (20) 

<223> Xaa at position 20 is Asp, Lys, Glu, or His; 
<22G> 

<221> MI SC_F £ ATU R E 

<222> (24).. (24) 

<223> Xaa at position 24 is Ala, Glu, Asp, ser, or His; 
<220> 

<221> MISC„FEATURE 

<222> (29) . , (29) 

<223> xaa at position 29 is Thr, ser, Lys, Arg, Trp. Tyr, Phe, Asp, Gly 
, Pro, His, or Glu; 

<220> 

<221> MISC_FEATURE 

<222> (31).. (31) 

<223> Xaa at position 31 is Lys, Arg, Thr, Ser, Glu, Asp, Trp, Tyr, Phe 
, His, Gly, or is deleted. 

<220> 

<221> MIS C_j- E A TUR E 

<222> (32).. (32) 

<223> Xaa at position 32 is Pro or is deleted. 



15 
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<400> 5 

xaa xaa Glu Gly Thr xaa Thr Ser Asp xaa ser ser Tyr Leu Glu xaa 
1 5 10 15 

xaa Ala Ala xaa Glu phe lie Xaa Trp teu val Lys xaa Arq xaa xaa 
20 25 50 

<210> 6 

<211> 32 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<220> 

<221> MISC_FEATUR£ 

<222> CO,. CD 

<223> Xaa at position 1 is L-hi sticline, D-histidine, or is deleted. 
<220> 

<223> MISC.FEATURE 

<222> (2) . . (2) 

<223> Xaa at position 2 is Gly, Ala, val , Leu, lie, Ser, or Thr; 
<220> 

<223> MI 5C_FE ATU R £ 

<222> (16) , . (16) 

<223> Xaa at position 16 is Gly, Asp, Glu, Gin, Asn, Lys, Arg, or Cys; 



<22Q> 

<221> MISC... FEATURE 

<222> (173-.C17) 

<223> Xaa at position 17 is His, Asp, Lys, Glu, or Gin; 

<220> 

<223> MISCFEATURE 

16 
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<222> {18) , . (18) 

<223> Xaa at position 18 is Ala, Glu, His, Phe, Tyr, Trp, Arg, or Lys; 
<220> 

<221> MISC-FEATURE 
<222> (24) . . (24) 

<223> Xaa at position 24 is Ala, Glu, Asp, Ser, or His; 
<220> 

<221> M I S C„ F E ATU R E 

<222> (31) . . (31) 

<223> Xaa at position 31 is Lys, Arg, Thr, Ser, Glu, Asp, Trp, Tyr, Phe 
, His, Gly, Gly-Pro, or is deleted. 

<220> 

<221> MISC_FEATURE 
<222> (32).. (32) 

<223> Xaa at position 32 is Pro or is deleted. 
<400> 6 

xaa xaa Glu Gly Thr Phe Thr ser Asp val Ser Ser Tyr teu Glu Xaa 
15 10 15 

xaa xaa Ala Lys Glu Phe lie Xaa Trp Leu val Lys Gly Arg xaa xaa 
20 25 30' 

<210> 7 
<211> 31 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> synthetic construct 
<220> 

<221> MISC„FEATURE 

17 
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<2Z2> CD . . (1) 

<223> Xaa at position 1 is L-histidine, D-histidine, or is deleted. 

<220> 

<221> MISCFEATURE 
<222> (2) . . (2) 

<223> Xaa at position 2 is Ala, Gly, Val, Thr, lie, and alpha-methyl -Al 
a; 

<220> 

<?.21> Mt5C„FEATORF. 
<222> CIS) . , (15) 

<223> Xaa at position 15 is Glu, Gin, Ala, Thr, Ser, Hind Gly; 
<220> 

<221> MISCFEATURE 
<222> (21) . . (21) 

<223> xaa at position 21 is Glu, Gin, Ala, Thr, ser, and Gly; 
<400> 7 

Xaa xaa Glu Gly Thr Phe Thr ser Asp val ser Ser Tyr Leu Xaa Gly 
1 5 10 15 

Gin Ala Ala Lys Xaa Phe lie Ala Trp Leu Val Lys Gly Arg Glv 
20 25 30 - 

<210> 8 
<21I> 30 
<212> PR?' 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<22Q> 

<221> MISCFEATURE 
<222> (19).. (19) 

IS 
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<223> xaa at position 19 is Lys or Arg; 

<22Q> 

<221> !v!OD„RES 

<222> (27),, (27} 

<223> ACETYLATION 

<220> 

<221> MISC„PEATURE 

<222> (30).. (30) 

<223> Xaa at position 30 is Gly; 

<22Q> 

<221> MOD„RE5 

<222> (30) , . (30) 

<223> AMIDATION 

<400> 8 

Ala Glu Gly Thr Phe Thr ser Asp va'j set- ser Tyr Leu Glu Gly Gin 
15 10 15 

Ala Ala xaa Glu phe He Ala Trp Leu val Lys Glv Arg xaa 
20 25 30 

<210> 9 
<211> 39 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 9 

His ser Asp Gly Thr Phe Thr Ser Asp Leu ser Lys Gin Met Glu Glu 
1 5 10 15 

Glu Ala val Arg Leu Phe lie Glu Trp Leu Lys Asn Gly Gly Pro Ser 



19 
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Ser Gly Ala Pro Pro Pro Ser 
33 

<210> 10 
<213> 39 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> synthetic construct 
<400> 10 

His Gly Glu Gly Thr phe Thr ser Asp Leu Ser Lys Gin Met Glu Glu 
15 10 15 

Glu Ala val Arg Leu phe lie Glu Trp Leu Lys Asn Glv Gly Pro Ser 
20 25 ' 30 

Ser Gly Ala Pro Pro Pro ser 
35 



<21Q> 11 

<211> 39 

<212> PRT 

<213> Artificial sequence 



<220> 

<223> synthetic construct 
<22Q> 

<221> ?v]ISC„FEATURE 

<222> CD --CD 

<223> xaa at position I is L-histidine, D-histicHne, or is deleted, 



<220> 

<221> MSC_F£ATURE 

<222> (2) . . C2> 

<223> xaa at position 2 is Gly, Ala, or val; 



<220> 



20 
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<221> MISC„FEATORE 
<222> CIO) , . (10) 

<223> xaa at position 10 is Leu or.val; 



<220> 

<221> MISC_FEATURE 

<222> (12) . , (12) 

<223> Xaa at position 12 is Lys or sen 



<220> 

<221> MISCFEATURE 

<222> (13).. Q3) 

<223> Xaa at position 13 is Gin or Tyr; 



<220> 

<221> MISCFEATURE 

<222> (14) » . (14) 

<223> xaa at position 14 is Met or Leu; 



<220> 

<221> MISCLFEATURE 

<222> (16} . . (16) 

<223> Xaa at position 16 is Glu or Gin; 



<220> 

<221> M ISC. FE ATI) R E 

<222> (17).. (17) 

<223> Xaa at position 17 is Glu or Gin; 



<220> 

<221> MISCL.FEATURE 

<222> (19)-- (19) 

<223> Xaa at position 19 is Val or Ala; 
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<220> 

<221> MXSCFEATURE 

<222> (20).. (20) 

<223> xaa at position 20 is Arg or Lys; 
<220> 

<221> MIS COFEATURE 

<222> (21) . . (21) 

<223> xaa at position 21 is Leu or Glu; 
<220> 

<221> MISC_FEATURE 

<222> (24) . . (24) 

<223> Xaa at position 24 is Glu or Ala; 
<220> 

<221> MISCFEATURE 

<222> (27). .(27) 

<22'i> Xaa at position 27 is Val or Lys; 
<220> 

<221> MISC....FEATURE 

<222> (28).. (28) 

<223> Xaa at position 28 is Asn or Lys: 
<220> 

<22I> MISC_FEATURE 

<222> (30) , , (30) 

<223> Xaa at position 30 is Gly or Arg; and 

<220> 

<221> MISC....FEATURE 

<222> C31}..(31} 

<223> xaa at position 31 is Gly, or Pro; 

22 
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<220> 

<221> MISCFEATU RE 

<222> (32). .(32) 

<223> Xaa at position 32 is Ser, or is absent. 

<220> 

<221> MISC.FEATURE 

<222> (33).. (33) 

<223> Xaa at position 33 is Ser, or is absent. 
<220> 

<221> MISCFEATURE 

<222> C34) . . (34) 

<223> Xaa at position 34 is Gly, or is absent. 
<220> 

<221> MISC„FEATURE 

<222> C35)..(35) 

<223> Xaa at position 35 is Ala, or is absent. 
<220> 

<221> MISC_FEATURE 

<222> (36) . , (36) 

<223> Xaa at position 36 is Pro, or is absent. 

<220> 

<221> MISCFEATURE 

<222> (37),. (37) 

<223> Xaa at position 3? is Pro, or is absent, 
<220> 

<221> M 1 SC„F E ATUR E 

<222> (38).. (38) 

23 
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<223> xaa at position 38 is fro, or is absent. 

<22Q> 

<221> MISCFEATURE 

<222> (39).. (39} 

<2?3> Xaa at position 39 is Pro, or is absent, 
<220> 

<221> MISCL.FEATURE 

<222> (39) . . (39) 

<223> Xaa at position 39 is Ser, or is absent. 

<400> 11 

xaa Xaa Glu Glu Thr Phe Thr ser Asp xaa Ser xaa xaa xaa Glu Xaa 
1 5 10 15 

Xaa Ala xaa Xaa xaa Phe lie xaa Trp Leu xaa xaa 61 y xaa xaa xaa 
20 25 30 

xaa Xaa Xaa xaa xaa xaa xaa 
35 

<210> 12 
<211> 31 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<220> 

<221> MISCLFEATURE 
<222> (1) . . CD 

<223> Xaa at position 1 is L-histidine, D-histidine, or is deleted, 

<220> 

<221> MISC_FEATURE 
<222> (2) . . (2) 

24 
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<223> xaa at position 2 is Ala, Gly, Va'l, Leu, He, Ser, or Thr; 
<220> 

<221> M I S C_ F E ATUR E 

<222> (6},. (6) 

<223> Xaa at position 6 is Phe, Trp. or Tyr; 
<220> 

<221> MISC.FEATURE 

<222> (10} . . (10) 

<223> xaa at position 10 is Va1, Trp, He, Leu. Phe, or Tyr; 
<220> 

<221> MISCFEATURE 

<222> (12) . . (12) 

<223> Xaa at position 12 is Ser, Trp, Tyr, Phe, Lys, He, Leu, Val; 
<220> 

<221> MI SQJF EATUR E 

<222> (13).. (B) 

<223> Xaa at position 13 is Tyr, Trp, or Phe; 
<220> 

<221> MI SC„F EATU R E 

<222> (14). .(14) 

<223> Xaa at position 14 is Leu, Phe, Tyr, or Trp; 

<220> 

<221> MXSCFEATURE 

<?.??> (16) . . (16) 

<223> xaa at position 16 is Gly, q]u, Asp, or Lys; 

<220> 

<221> MISC...FEATURE 

25 
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<22?> (19).. (19) 

<223> Xaa at position 19 is Ala r Va"i f lie, or leu; 



<22Q> 

<221> MXSC-.FEAXURE 

<222> (21) . . (21) 

<223> xaa at position 21 is Glu, lie, or Ala; 



<220> 

<221> MISC..FEATURE 

<222> (24} . . (24) 

<223> Xaa at position 24 is Ala or Glu; 



<220> 

<221> MISC„FEATURE 

<222> (27).. (27) 

<223> Xaa at position 27 is Val or He; and 



<220> 

<221> MOD„R£S 

<222> (30).. (30) 

<223> AMIDATTQN 



<220> 

<221> MISCFEATURE 

<222> (31).. (31) 

<223> Xaa at position 31 is Qly, His or is absent. 



<400> 12 

Xaa xaa Glu Gty Thr Xaa Thr Ser Asp xaa Ser xaa Xaa Xaa Glu Xaa 
15 10 IS 

Gin Ala xaa Lys xaa Phe lie Xaa Trp Leu xaa Lys Gly Arg xaa 
20 25 30 

<210> 13 
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<211> 616 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 13 

His Val Glu 61 y Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 
1 5 10 15 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu Val Lys Gly Arg Gly Asp 
20 • 25 30 

Ala His Lys ser Glu Val Ala His Arg Phe Lys Asp Leu Gly Glu Glu 
35 40 45 

Asn Phe Lys Ala Leu Val Leu lie Ala Phe Ala Gin Tyr Leu Gin Gin . 
50 55 60 

Cys Pro Phe Glu Asp His val Lys Leu Val Asn Glu val Thr Glu Phe 
65 70 75 80 

Ala Lys Thr Cys val Ala Asp clu ser Ala Glu Asn Cys Asp tys Ser 
85 90 95 

Leu His Thr Leu Phe Gly Asp Lys Leu cys Thr val Ala Thr Leu Arg 
100 105 110 

Glu Thr Tyr Gly Glu Met Ala Asp Cys Cys Ala Lys Gin Glu Pro Glu 
115 120 * 125 

Arg Asn Glu Cys Phe Leu Gin His Lys Asp Asp Asn Pro ash Leu Pro 
" 130 ' 135 140 

Arg Leu Val Arg pro Glu val Asp val Met Cys Thr Ala Phe His Asp 
145 150 155 160 

Asn Glu Glu Thr Phe Leu Lys Lys Tyr Leu Tyr Glu lie Ala Arg Arg 
165 170 175 

His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe Ala Lys Arg Tyr 
180 185 190 

Lys Ala Ala Phe Thr Glu Cys cys Gin Ala Ala Asp Lys Ala Ala cys 
195 200 205 

Leu Leu Pro Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala Ser ser 
210 215 220 

11 
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Ala Lys Gin Arg Leu Lys cys Ala Sep Leu Gin Lys Phe Gly Glu Arq 
225 230 235 240 

Ala Phe Lys Ala Trp Ala Val Ala Arg Leu Ser Gin Arc Phe Pro Lys 
245 250 " 255 

Ala Glu Phe Ala Glu val Ser Lys Leu val Thr Asp Leu Thr Lys Val 
260 265 270 

His Thr Glu Cys Cys His Gly Asp Leu Leu Glu Cys Ala Asp Asp Arg 
275 280 285 

Ala Asp Leu Ala Lys Tyr lie cys Glu Asn Gin Asp Ser lie Ser ser 
290 295 300 

Lys Leu Lys Glu Cys Cys Glu Lys Pro Leu Leu Glu Lys Ser .Mis Cys 
305 310 315 ' 320 

lie Ala Glu Val Glu Asn Asp Glu Met Pro Ala Asp Leu pro ser Leu 
325 330 335 

Ala Ala Asp Phe Val Glu Ser Lys Asp val cys Lys Asn Tyr Ala Glu 
MO 345 350 

Ala Lys Asp val Phe Leu Gly Met Phe Leu Tyr Glu Tyr Ala Arg Arg 
355 360 365 

His Pro Asp Tyr Ser Val Val Leu Leu Leu Arg Leu Ala Lys Thr Tyr 
370 375 380 

Glu Thr Thr Leu Glu Lys Cys Cys Ala Ala Ala Asp Pro His Glu cys 
385 390 395 400 

ryr Ala Lys val Phe Asp Glu Phe Lys Pro Leu val Glu Glu Pro Gin 
405 ' 410 415 

Asn Leu lie Lys Gin Asn Cys Glu Leu Phe Glu Gin Leu Gly Glu Tyr 
420 425 430 

Lys Phe Gin Asn Ala Leu Leu Val Arg Tyr Thr Lys Lys val Pro Gin 
435 440 445 

val ser Thr pro Thr Leu Val Glu val Ser Ara Asn Leu Gly Lys val 
450 455 460 

Gly ser Lys cys Cys Lys His Pro Glu Ala Lys Arg Met Pro cys Ala 
465 ' 470 475 480 

Glu Asp Tyr Leu Ser Val Val Leu Asn Gin Leu Cys val Leu His Glu 
485 490 495 

?8 
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Lys Thr Pro val ser Asp Arg val Thr Lys cys cys Thr Glu ser Leu 
500 505 5X0 

val Asn Arg Arg Pro Cys Phe Ser Ala Leu Glu Val Asp Glu Thr Tyr 
SIS 520 525 

Val Pro Lys Glu Phe Asn Ala Glu Thr Phe Thr Phe His Ala Asp lie 
530 53 S 540 

cys Thr Leu Ser Glu Lys Glu Arq Gin lie Lys Lys Gin Thr Ala Leu 
545 550 555 560 

val Glu Leu val Lys His Lys Pro Lys Ala Thr Lys Glu Gin Leu Lys 
565 570 " 575 

Ala Val Mat Asp Asp Phe Ala Ala Phe val Glu Lvs Cys Cys Lys Ala 
580 585 - 590 

Asp Asp Lys Glu Thr cys Phe Ala Glu Glu Gly Lys Lys Leu val Ala 
595 600 605 

Ala Ser Gin Ala Ala Leu Gly Leu 
610 615 

<H0> 14 

<213> 631 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 14 

His val Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 
1 5 10 15 

Gin Ala Ala Lys clu Phe lie Ala Trp Leu val Lys Gly Arg Gly Gly 

Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly ser Asp Ala 
35 40 45 

His Lys Ser Glu val Ala His Arg phe Lys Asp Leu Gly Glu Glu Asn 
50 55 60 

Phe Lys Ala Leu val Leu lie Ala Phe Ala Gin Tyr Leu Gin Gin cys 
65 70 ?5 30 
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Pro Phe Glu Asp His val Lys Leu Val Asn 61 u val Thr Glu Phe Ala 
85 90 95 

Lys Thr Cvs val Ala Asp Glu Ser Ala Glu Asn Cys Asp Lys Ser Leu 
100 105 110 

His Thr Leu Phe Gly Asp Lys Leu cys Thr val Ala Thr Leu Arg Glu 
115 " 120 125 

Thr Tyr Gly Glu Met Ala Asp Cys Cys Ala Lys Gin Glu Pro Glu Arg 
130 135 140 

Asn Glu Cys Phe Leu Gin His Lys Asp Asp Asn Pro Asn Leu Pro Arg 
145 150 155 160 

Leu val Arg Pro Glu Val Asp Val Met Cys Thr Ala phe His Asn Asn 
165 170 175 

Glu Glu Thr Phe teu Lys Lvs Tyr Leu Tyr Glu lie Ala Arg Arg His 
ISO ' 185 190 

Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe Ala Lys Arg Tyr Lys 
195 200 205 

Ala Ala Phe Thr Glu Cys cys Gin Ala Ala Asp Lvs Ala Ala Cys Leu 
210 215 220 

Leu Pro Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala ser ser Ala 
225 230 " 235 240 

Lys Gin Arg Leu Lys Cys Ala Ser Leu Gin Lys Phe Gly Glu Arg Ala 
245 250 255 

Phe Lys Ala Trp Ala val Ala Arg Leu Ser Gin Arq Phe Pro Lys Ala 

260 265 ~ 270 ' 

Glu Phe Ala Glu val Ser Lys teu Val Thr Asp Leu Thr Lys Val His 
275 280 285 

Thr Glu Cys Cys His Gly Asp Leu Leu Glu Cys Ala Asp Asp Arg Ala 
290 295 300 

Asp Leu Ala Lvs Tyr lie cys Glu Asn Gin Asp Ser Tie Ser Ser Lys 
305 310 315 320 

Leu Lys Glu cys cys Glu Lys Pro Leu Leu Glu Lys Ser His cys lie 
325 330 335 

Ala Glu val Glu Asn Asp Glu Met Pro Ala Asp Leu Pro ser Leu Ala 
340 345 350 

30 
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Ala Asp Phe Val Glu Ser tys Asp val Cys Lys Asrt Tyr Ala Glu Ala 
355 360 365 



Lys Asp Val Phe Leu Gly Met Phe Leu Tyr Glu Tyr Ala Arg Arc* His 
370 375 3S0 



Pro Asp jyr ser val Val Leu Leu Leu Arg leu Ala tys Thr Tyr Glu 
385 390 395 400 



Thr Thr Leu Glu Lys cys Cys Ala Ala Ala Asp Pro His Glu Cys Tyr 
405 410 415 



Ala Lys Val Phe Asp Glu Phe Lvs Pro Leu val Glu Glu Pro Gin Asn 
420 425 430 



Leu He Lys Gin Asn Cys Glu Leu Phe Glu Gin Leu Gly Glu Tyr Lys 
435 440 445 



Phe Gin Asn Ala Leu Leu val Arg Tyr Thr Lys Lys Val Pro Gin Val 
4SQ 455 460 



Ser Thr Pro Thr Leu val Glu val ser Arg Asn Leu Gly Lys Val Gly 
465 470 475 480 



Ser Lys Cys Cys Lys His Pro Glu Ala Lys Arg Met Pro Cys Ala Glu 
485 490 495 



Asp Tyr Leu ser val val Leu Asn Gin Leu Cys Val Leu His Glu Lys 
500 505 510 



Thr pro Val Ser Asp Arg val Thr Lys Cys Cys Thr Glu ser Leu val 
515 520 ' 52 S 



Asn Arg Arg Pro Cys Phe ser Ala Leu Glu val Asp Glu Thr Tyr val 
530 535 540 



Pro Lys Glu phe ash Ala Glu Thr Phe Thr phe His Ala Asp lie Cys 
545 550 555 560 



Thr Leu Ser Glu Lys Glu Arg Gin lie Lys Lys Gin Thr Ala Leu val 
565 570 575 



Glu Leu val Lys His Lvs Pro Lys Ala Thr Lys Glu Gin Leu Lvs Ala 
580 585 590 



val wet Asp Asp Phe Ala Ala Phe val Glu Lys cys cvs Lys Ala Asp 
595 600 605 



Asp Lys Glu Thr cys Phe Ala Glu Glu Gly Lys Lys Leu Val Ala Ala 



610 



615 
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Ser Gin Ala Ala Leu Gly Leu 
625 630 



<210> 15 

<211> 640 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> synthetic construct 

<400> 15 

His Gly Glu Gly Thr phe Thr ser Asp Val Ser Ser Tyr Leu G"iu Glu 
1 S 10 15 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu val Lys Gly Arg Gly ser 
20 25 30 

Ser Gly Ala Pro Pro Pro Ser Gly. Gly Gly Gly Gly ser Gly Gly Gly 
35 40 45 

Gly Ser Gly Gly Gly Gly ser Asp Ala His Lys Ser Glu val Ala His 
50 55 60 

Arg Phe Lys Asp Leu Gly Glu Glu Asn Phe Lys Ala Leu val Leu He 
65 70 75 80 

Ala Phe Ala Gin Tyr Leu Gin Gin Cys Pro Phe Glu Asp His val Lys 
85 90 95 

Leu Val Asn Glu Val Thr Glu Phe Ala Lys Thr cys val Ala Asp Glu 
100 105 110 

Ser Ala Glu Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp Lys 
115 120 125 

Leu cys Thr val Ala Thr Leu Arg Glu Thr Tyr Gly Glu met Ala Asp 
130 135 140 

Cys Cys Ala Lys Gin Glu Pro Glu Arg Asn Glu Cys Phe Leu Gin His 
145 ' 150 155 160 

Lys Asp Asp Asn Pro Asn Leu pro Arg Leu val Arg pro Glu val asp 
165 170 175 

Val Met cys Thr Ala Phe His Asp Asn Glu Glu Thr phe Leu Lys Lys 
180 185 190 
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Tyr Leu Tyr Glu lie Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro Glu 
195 200 205 

Leu Leu Phe Phe Ala Lys Arg Tyr Lys Ala Ala Phe Thr Glu cys cys 
210 215 220 

Gin Ala Ala Asp Lys Ala Ala cys Leu Leu Pro Lys Leu Asp Glu Leu 
225 230 235 240 

Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gin Arg Leu Lys Cys Ala 
245 250 255 

ser Leu Gin Lys Phe Gly Glu Arg Ala Phe Lys Ala Trp Ala val Ala 
260 265 270 

Arg Leu ser Gin Arg phe Pro Lys Ala Glu Phe Ala Glu val ser Lys 
275 280 285 

Leu val Thr Asp Leu Thr Lys val His Thr Glu cys Cys His Gly Asp 
290 295 300 

Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp Leu Ala Lys Tyr lie Cys 
305 310 315 320 

Glu Asn Gin Asp Ser lie ser ser Lys Leu lvs Glu cys cys Glu Lys 
325 ' 330 ' 335 

pro Leu Leu Glu Lys ser His cys lie Ala Glu val Glu Asn Aso Glu 
340 345 350 

Met Pro Ala Asp Leu Pro Ser Leu Ala Ala Asp Phe Val Glu Ser Lys 
355 360 365 

Asp Val Cys Lys Asn Tyr Ala Glu Ala Lys Asp val Phe Leu Gly Met 
370 ■ 375 3S0 

phe Leu Tyr Glu Tyr Ala Arg Arg His Pro Asp Tyr ser val val leu 
385 390 395 400 

Leu Leu Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu Glu Lvs cys Cys 
405 410 * 415 

Ala Ala Ala Asp Pro His Glu cys Tyr Ala Lys Val Phe Asp Glu Phe 
420 425 430 

Lys Pro teu val Glu Glu Pro Gin Asn Leu lie Lys Gin Asn Cys Glu 
435 440 445 

Leu Phe Glu Gin Leu Gly Glu Tyr Lys Phe Gin Asn Ala Leu Leu Val 
450 455 460 



WO 02/46227 PCT/US01/43165 
X-13991.ST25.txt 
Arg Tyr rhr Lys Lys Val pro Gin val ser Thr Pro Thr Leu Val Glu 



Val Ser Arg Asn leu Gly Lys val Gly ser Lys Cys Cys Lys His Pro 
485 490 495 

61 u Ala Lys Arg Mat Pro cys Ala Glu Asp Tyr Leu ser val val Ley 
500 505 510 

Asn Gin Leu cys val Leu His Glu Lys Thr Pro val ser Asp Arg val 
515 520 525 

Thr Lys Cys Cys Thr Glu Ser Leu Val Asn Arg Arg pro Cys Phe Ser 
530 535 540 

Ala Leu Glu val Asp Glu Thr Tyr val Pro Lys Glu Phe Asn Ala Glu 
545 550 555 560 

Thr Phe Thr phe His Ala Asp He cys Thr Leu ser Glu Lys Glu Arg 
565 570 ' 575 

Gin lie Lys Lys Gin Thr Ala Leu val Glu Leu val cys His Lys Pro 
580 5S5 590 ' 

Lys Ala Thr Lys Glu Gin Leu lys A'ia Val net Asp Asp Phe Ala Ala 
600 605 



595 60< 

Phe val Glu Lys cys cys Lys Ala Asp Asp Lys Glu Thr cys Phe Ala 
610 615 620 

Glu Glu Gly Lys l.ys Leu val Ala Ala Ser Gin Ala Ala Leu Gly Leu 
62 S 630 635 640 

<210> 16 ■ 

<211> 624 

<212> PRT 

<213> Artificial Sequence 



<220> 

<22i> synthetic construct 
<4Q0> 16 

His Gly Glu Gly Thr Phe Thr ser Asp Leu ser Lys Gin Met Glu Glu 
1 5 10 15 

Glu Ala Val Arg Leu Phe lie Glu Trp Leu Lys Asn Gly Gly pro Ser 
20 25 30 
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Ser Gly Ala Pro Pro Pro Ser Asp Ala His Lys Ser Glu Val Ala His 
35 40 45 

Arg Phe Lys Asp Leu Gly Glu Glu Asn Phe Lys Ala Leu Val Leu He 
50 55 60 

Ala Phe Ala Gin Tyr Leu Gin Gin cys Pro Phe Glu Asp His Val Lys 
65 70 75 80 

Leu val Asn Glu val Thr Glu Phe Ala Lys Thr cys val Ala Asp Glu 
85 90 95 

Ser Ala Glu Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp Lys 
100 105 110 

Leu Cys Thr val Ala Thr Leu Arg Glu Thr Tyr Gly Glu Met Ala Asp 
115 120 125 

Cys Cys Ala Lys Gin Glu Pro Glu Arg Asn Glu cys Phe Leu Gin His 
130 135 " 140 



val Met cys Thr Ala Phe His Asp Asn Glu Glu Thr phe Leu Lys Lys 

165 170 175 

Tyr Leu Tyr Glu lie Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro Glu 

ISO 185 " 190 

Leu Leo Phe Phe Ala Lys Arg Tyr Lys Ala Ala Phe Thr Glu cys cys 



Gin Ala Ala Asp Lys Ala Ala cys Leu Leu pro Lys Leu Asp Glu Leu 
210 215 220 

Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gin Arg Leu Lys Cys Ala 
225 230 235 240 

Ser teu Gin Lys Phe Gly Glu Arg Ala Phe Lys Ala Trp Ala Val Ala 
245 250 255 

Arg Leu Ser Gin Arg Piie Pro Lys Ala Glu Phe Ala Glu Val Ser Lys 
260 265 270 

Leu val Thr Asp Leu Thr Lys val His Thr Glu cys Cys His Gly Asp 
275 280 ~lh 

Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp Leu Ala Lys Tyr lie cys 
290 295 300 
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Glu Asn Gin Asp ser lie Ser Ser Lys Leu Lys Glu Cys Cys Glu Lys 
305 310 315 320 



Pro Leu Leu Glu Lys Ser His Cys He Ala Glu Val Glu Asn Asp Glu 
325 330 335 



mt Pro Ala Asp Leu Pro Ser Leu Ala Ala Asp Phe val Glu Ser Lys 
340 345 350 



Asp Val Cys Lys Asn Tyr Ala Glu Ala Lys Asp Val Phe Leu Gly Met 
355 360 365 



Phe Leu Tyr Glu Tyr Ala Arg Arg His Pro Asp Tyr Ser Val Val Leu 
370 ' 375 380 



Leu Leu Arq Leu Ala Lys Thr Tyr G'iu Thr Thr Leu Glu Lys Cys Cys 
385 " 390 395 400 



Ala Ala Ala Asp Pro His Glu cys Tyr Ala Lys val Phe Asp Glu Phe 
405 410 415 



Lys Pro Leu Val Glu Glu Pro Gin Asn Leu lie Lys Gin Asn Cys Glu 
420 425 430 



Leu Phe Glu Gin Leu Gly Glu Tyr Lys Phe Gin Asn Ala Leu Leu val 
435 440 445 



Arg Tyr Thr Lys Lys val Pro Gin val ser Thr Pro Thr Leu val Glu 
450 455 460 



val Ser Arg Asn Leu Gly Lys val Gly Ser Lys Cys Cys Lys His Pro 
465 470 ' 475 480 



Glu Ala Lys Arq Met Pro Cvs Ala Glu Asp Tyr Leu Ser Val Val Leu 
485 490 495 



Asn Gin Leu Cys val Leu His Glu Lys Thr Pro val ser Asp Arg val 
500 505 510 



Thr Lys Cys Cys Thr Glu Ser Leu Val Asn Arg Am Pro cys Phe Ser 
515 S20 525 



Ala Leu Glu val Asp Glu Thr Tyr val Pro Lys Glu Phe Asn Ala Glu 
530 • 535 540 



Thr Phe Thr phe His Ala Asp lie Cys Thr Leu ser Glu Lys Glu Arg 
545 550 555 560 



Gin Tie Lys Lys Gin Thr Ala Leu Val Glu Leu val Lys His Lys Pro 



570 
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Lys Ala Thr Lys Glu Gin Leu Lys Ala val Met Asp Asp Phe Ala Ala 
580 585 590 

Phe Val Glu Lys Cys Cys Lys Ala Asp Asp Lys 61 u Thr Cys Phe Ala 
595 600 605 

Glu Glu Gly Lys Lys Leu Val Ala Ala Ser Gin Ala Ala Leu Gly Leu 
610 615 620 

<210> 17 

<211> 640 

<212> PRT 

<213> Artificial sequence 



<220> 

<223> synthetic construct 
<400> 17 

His Gly Glu Gly Thr Phe Thr Ser Asp Leu ser Lys Gin Met Glu Glu 
1 5 10 15 

Glu Ala Val Arg Leu Phe lie Glu Trp Leu Lys Asn Gly Gly Pro Ser 
20 25 ■ 30 

Ser Gly Ala Pro Pro Pro ser Gly Gly Gly Gly Gly Ser Gly Gly Gly 
35 40 45 

Gly Ser Gly Gly Gly Gly Ser Asp Ala His Lys Ser Glu Val Ala His 
50 55 60 

Arg Phe Lys Asp Leu Gly Glu Glu Asn Phe Lys Ala Leu Val Leu He 



Ala phe Ala Gin Tyr Leu Gin Gin Cys Pro Phe Glu Asp His val Lys 
85 90 95 

Leu val Asn Glu Val Thr Glu phe Ala Lys Thr cys Val Ala Asp Glu 
100 105 110 

Ser Ala Glu Asn Cvs Asp Lys ser Leu His Thr Leu Phe Gly Asp Lys 
115 " 120 125 

Leu cys Thr val Ala Thr Leu Acq Glu Thr Tyr Gly Glu mt Ala Asp 

130 135 " 140 

cys Cys Ala Lys Gin Glu Pro Glu Arg Asn Glu Cys Phe Leu Gin His 
145 ' 150 155 160 
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Lys Asp Asp Asn Pro Asn Leu Pro Arg Leu val Arg Pro Glu Val Asp 
165 170 175 



val Met cvs Thr Ala Phe His Asp Asn Glu Glu Thr Phe Leu Lys Lys 
180 185 190 



Tyr Leu Tyr Glu xle Ala Arg Arg His Pro Tyr Phe T^r Ala Pro Glu 



Leu Leu Phe Phe Ala Lys Arg Tyr Lys Ala Ala Phe Thr Glu Cys Cys 
210 215 220 



Gin Ala Ala Asp Lys Ala Ala Cys Leu Leu Pro Lys Leu Asp Glu Leu 
225 230 235 240 



Arq Asp Glu Gly Lys Ala ser ser Ala Lys Gin Arg Leu Lys Cys Ala 
245 250 ~ 255 



Ser Leu Gin Lys Phe Gly Glu Arg Ala Phe lvs Ala Trp Ala Val Ala 
260 265 270 



Arg Leu Ser Gin Arg Phe Pro Lys Ala Glu Phe Ala Glu val Ser lvs 
275 280 285 



Leu val Thr Asp Leu Thr Lys val His Thr Glu cys Cys His Gly Asp 
290 295 300 



Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp Leu Ala Lys Tyr 2le Cys 
305 310 315 320 



Glu Asn Gin Asp ser lie Ser ser Lys Leu Lys Glu Cys Cys Glu Lys 
325 330 335 



pro Leu teu Glu Lys ser His cys Lie Ala Glu val Glu Asn Asp Glu 
340 345 350 



Met Pro Ala Asp Leu Pro Ser Leu Ala Ala Asp Phe Val Glu Ser Lys 
355 360 365 



Asp val Cys Lys Asn Tyr Ala Glu Ala Lys Asp Val Phe Leu Gly Met 
370 37S 380 



Phe Leu Tyr Glu Tyr Ala Arg Arg His -Pro Asp Tyr Ser val val Leu 
385 390 ' 395 400 



Leu Leu Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu Glu Lys Cys cys 
405 410 425 



Ala Ala Ala Asp Pro His Glu cys Tyr Ala Lys Val Phe Asp Glu Phe 



420 



4 25 
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Lys Pro Leu va't Glu Glu Pro Gin Asn Leu He Lys Gin Asn cys Glu 
435 440 445 

Leu Phe Glu Gin Leu G'ly Glu Tyr Lys Phe Gin Asn Ala Leu Leu Val 
450 ' 455 ' " 460 

Arg Tyr Thr Lys Lys val Pro Gin val Ser Thr Pro Thr Leu val Glu 
465 470 475 480 

Val Ser Arg Asn Leu Gly Lys Val Gly Ser Lys Cys Cys Lys His Pro 
485 490 495 

Glu Ala Lys Arg Met pro Cys Ala Glu Asp Tyr Leu ser Val Val Leu 
500 505 510 

Asn Gin Leu cys val Leu His Glu Lys Thr Pro val Ser Asp Are val 
515 520 525 

Thr tys cys Cys Thr Glu ser Leu val Asn Arg Arg pro cys Phe Ser 
530 535 540 

Ala Leu Glu Val Asp Glu Thr Tyr va'l Pro lys Glu Phe Asn Ala Glu 
545 550 555 560 

Thr Phe Thr Phe His Ala Asp lie cys Thr Leu ser Glu Lys Glu Arg 
565 570 ' 575 

Gin lie Lys Lys Gin Thr Ala Leu val Glu Leu Val Lv$ His Lys Pro 
5S0 585 590 

Lys Ala Thr Lys Glu Gin Leu Lys Ala Val Met Asp Asp Phe Ala Ala 
595 600 605 

Phe val Glu Lys cys cys Lys Ala Asp Asp Lys Glu Thr Cys Phe Ala 
610 615 620 

Glu Glu Gly Lys Lys Leu val Ala Ala Ser Gin Ala Ala Leu Gly Leu 
625 630 635 640 

<210> 18 

<211> 264 

<212> PRT 

<213> Artificial sequence 



<220> 

<223> synthetic construct 
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<400> 18 

His Val Glu G'fy Thr Phe Thr ser Asp val ser ser Tyr Leu Glu Gly 
1 5 10 15 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu val Lys Gly Arg Gly Ala 
20 7.5 ' 30 

Glu Pro Lys ser cys Asp Lys Thr His Thr cys pro Pro Cys Pro Ala 
35 40 45 

Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro 
50 55 60 

Lys Asp Thr Leu Met He Ser Arg Thr Pro Glu Val Thr cys val val 
65 70 75 80 

val Asp val ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val 
85 90 95 

Asp Gly Val Glu val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 
100 105 110 

Tyr Asn Ser Thr Tyr Arg Val Val Ser Va'f Leu Thr Val Leu His Gin 
115 120 125 



Leu Pro Ala Pro lie Glu Lys Thr lie Ser Lys Ala Lys Gly Gin pro 



Arg Glu Pro Gin val Tyr Thr Leu Pro Pro Ser Arg Glu Glu Met Thr 
165 170 175 

tys Asn Gin yal ser Leu Thr cys Leu val Lys Gly Phe Tyr Pro ser 



180 185 

Asp lie Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr 
195 200 205 

Lys Thr Thr Pro pro val Leu Asp ser Asp Gly ser Phe Phe Leu Tyr 
210 215 " 220 

Ser Lys Leu Thr val Asp Lys Ser Arg Trp Gin Gin Gly Asn val Phe 
225 230 235 240 

ser cys Ser val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys 
245 250 255 

Ser teu Ser Leu ser Pro Gly Lys 
260 
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<210> 19 

<213> 272 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 19 

His val Glu Gly Thr Phe Thr Ser Asp val ser Ser Tyr Leu Glu Gly 
15 10 15 



Gin Ala Ala Lys Glu Phe lie Ala Trp Leu val Lys Gly Arg Gly Ser 
20 25 30 



ser Gly Ala Pro Pro Pro ser Ala Glu Pro Lys ser Cys Asp Lys Thr 
35 40 45 



His Thr Cys Pro Pro cys Pro Ala Pro Glu Leu Leu Gly Gly Pro ser 
50 55 60 



Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met lie Ser Arc 
65 70 75 80 



Thr Pro Glu val Thr cys val val val Asp val ser His Glu Asp Pro 
85 90 95 



Glu Val Lys Phe Asn Trp Tyr Val Asp Gly val Glu Val His Asn Ala 
100 105 110 



Lys Thr Lys Pro Arg Glu Glu Gin Tyr Asn Ser Thr Tyr Arg val Val 
115 . 120 125 



ser val Leu Thr Val Leu His Gin Asp Trp Leu Asn Gly cys Glu Tyr 
130 135 140 



Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro lie Glu Lvs Thr 
145 ISO 155 160 



lie Ser Lys Ala Lys Gly Gin Pro Arg Glu Pro Gin val Tyr Thr Leu 
165 170 175 



Pro Pro ser Arg Glu Glu Met Thr Lys Asn Gin val ser Leu Thr Cys 
180 185 190 



Leu val Lys Gly Phe Tyr Pro Ser Asp lie Ala Val Glu Trp Glu Ser 




200 



205 
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Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro val Leu Asp 
210 215 220 

Ser Asp Gly Ser Phe Phe Leu Tyr ser Lys Leu Thr val Asp Lys Ser 
225 230 235 240 

Arg Trp Gin Gin Gly Asn Val Phe Ser Cys ser val Met His Glu Ala 
243 250 255 

Leu His Asrs His Tyr Thr Gin Lys ser Leu Ser Leu Ser Pro Gly Lys 
260 265 270 

<210> 20 
<211> 264 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 20 

His val Glu Gly Thr Phe Thr Ser Asp val Ser Ser Tyr Leu Glu Glu 
15 10 15 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu val Lys Gly Arg Gly Ala 
20 25 " 30 

Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala 
35 40 45 

Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro 
50 ' 55 60 

Lys Asp Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val 
65 70 75 80 

Val Asp val ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr val 
85 90 95 

Asp Gly val Glu val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 
100 105 110 

Tyr Asn Ser Thr Tyr Arg Val val Ser Val Leu Thr Val Leu His Gin 
115 120 125 

Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala 
130 135 140 

42 
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Leu Pro Ala Pro lie Glu Lys Thr lie Ser Lys Ala Lys Gly Gin pro 
145 150 155 160 

Arg Glu Pro Gin val Tyr Thr Leu Pro Pro ser Arg Glu Glu Met Thr 
165 170 175 

Lys Asn Sin Val Ser Leu Thr cys Leu val Lys Gly Phe Tyr pro ser 
180 185 190 

Asp lie Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu ash Asn Tyr 
195 200 205 

Lys Thr Thr Pro Pro val Leu Asp Ser Asp Gly Ser Phe phe Leu Tyr 
210 215 220 

ser Lys Leu Thr val Asp Lys ser Arg Trp Gin Gin Gly Asn val Phe 
225 230 235 240 

ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys 
245 250 255 

ser Leu ser Leu Ser Pro Gly Lys 
260 

<210> 21 
<211> 272 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 21 

His val Glu Gly Thr Phe Thr Ser Asp Val ser Ser Tyr Leu Glu Glu 
1 5 10 15 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu Val Lys Gly Arg Gly Ser 
20 25 30 

ser Gly Ala Pro Pro Pro Ser Ala Glu Pro Lys Ser cys Asp Lys Thr 
35 40 45 

His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro ser 
50 55 60 

Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met lie ser Arg 
65 70 75 ■ 80 

43 
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Thr pro Glu Val Thr Cys Vai val Val Asp val Ser His Glu Aso Pro 
85 90 95 

Glu val Lys Phe Asn Trp Tyr val Asp Giy val Glu val His Asn Ala 
100 105 110 

Lys Thr Lys Pro Arg Glu Glu Gin Tyr Asn Ser Thr Tyr Arg val Val 
115 120 125 

Ser val Leu Thr val Leu His Gin Asp Trp Leu Asn Giy Lys Glu Tyr 
130 135 140 

Lys cys Lys val Ser Asn Lys Ala Leu Pro Ala Pro lie Glu Lys Thr 
145 150 155 160 

lie ser Lys Ala Lys Giy Gin Pro Arg Glu Pro Gin val Tyr Thr Leu 
165 170 1.75 

Pro pro ser Arg Glu Glu Met Thr Lys Asn Gin Val Ser Leu Thr cys 
180 185 190 

Leu Val Lys Giy Phe Tyr Pro Ser Asp He Ala val Glu Trp Glu Ser 
195 200 205 

Asn Giy Gin Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp 
210 215 220 

Ser Asp Giy Ser phe Phe Leu Tyr ser Lys Leu Thr Val Asp Lys ser 
225 230 235 240 

Arg Trp Gin Gin Giy Asn Val Phe Ser Cys Ser Val Met His Glu Ala 
245 250 255 

Leu His Asn His Tyr Thr Gin Lys ser Leu Ser Leu Ser Pro Giy Lys 
260 265 270 

<210> 22 
<211> 272 

<212> pirr 

<213> Artificial sequence 



<220> 

<223> synthetic, construct 
<400> 22 

His Giy Glu Giy Thr Phe Thr Ser Asp Val ser ser Tyr Leu Glu Glu 
15 10 15 

44 . 
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Gin Ala Ala Lys Glu Phe He Ala Trp Leu Val Lys Gly Arg Gly Ser 
20 2S 30 

Ser Gly Ala Ser ser Gly Ala Ala Glu Pro Lys Ser cys Asp Lvs Thr 
35 40 45 

His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro ser 
50 5S 60 

Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met lie Ser Arg 
65 70 75 80 

Thr Pro Glu Val Thr Cys Val Val val Asp val ser His Glu Asp Pro 
85 90 95 

Glu val Lys Phe Asn Trp Tyr val Asp Gly Val Glu Val His Asn Ala 
100 105 110 

Lys Thr Lys Pro Arg Glu Glu Gin Tyr Asn Ser Thr Tyr Arg val val 
115 120 125 

Ser Val Leu Thr val Leu His Gin Asp Trp Leu Asn Gly Lys Glu Tyr 
130 135 140 

Lys cys Lys val ser Asn Lys Ala Leu Pro Ala Pro He Glu Lys Thr 
145 150 155 160 

lie Ser Lys Ala Lys Gly Gin Pro Ara Glu Pro Gin val Tyr Thr Leu 
165 " 170 175 

Pro Pro Ser Arg Glu Glu Met Thr Lys Asn Gin Val Ser Leu Thr cys 
180 185 190 

Leu val Lys Gly Phe Tyr Pro ser Asp He Ala val Glu Trp Glu Ser 

195 ' 200 205 

Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr Thr pro Pro val Leu Asp 
210 215 220 

Ser Asp Gly Ser Phe Phe Leu Tyr ser Lys Leu Thr Val Asp Lys Ser 
225 230 235 240 

Arq Trp Gin Gin Gly Asn Val Phe ser cys Ser val Met His Glu Ala 
245 250 255 

Leu His Asn His Tyr Thr Gin Lys Ser Leu Ser Leu Ser Pro Gly Lys 
260 265 270 
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<211> 287 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 23 

His 61 y Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu 61 u 
15 10 15 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu val Lys Gly Arg Gly ser 



Ser 6ly Ala Pro Pro Pro ser Gly Gly Gly Gly ser Gly Gly Gly Gly 

35 40' " 45 

Ser Gly Gly Gly Gly ser Ala Glu Pro Lys Ser cys Asp Lys Thr His 
50 55 60 

Thr Cys pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val 
65 70 75 80 

Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met He Ser Arg Thr 
85 90 35 

pro Glu Val Thr cys val val val Asp val ser His Glu Asp Pro Glu 
100 " 105 110 

val Lys Phe Asn Trp Tyr val Asp Gly Val Glu val His Asn Ala Lys 
US 120 125 

Thr Lys Pro Arg Glu Glu Gin Tyr Asn Ser Thr Tyr Arg Val Val Ser 
130 135 ' 140 

Val Leu Thr Val Leu His Gin Asp Trp Leu Asn Gly Lys Glu ryr Lys 
145 ISO 155 150 

Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro lie Glu Lys Thr lie 
165 170 175 

Ser Lys Ala Lys Glv Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro 
180 IS 5 * 190 

pro ser Arg Glu Glu Met Thr Lys Asn Gin Val Ser Leu Thr Cys Leu 
195 200 205 

Val Lys Gly Phe Tyr Pro Ser Asp .Tie Ala Val Glu Trp Glu Ser Asn 
210 215 220 

46 
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Gly Gin Pro Glu Asn Asn Tyr Lys Thr Thr pro pro val Leu Asp ser 
225 230 235 240 

Asp Gly ser Phe Phe Leu Tyr ser lys Leu Thr val Asp Lys Ser Arg 
245 250 ' 255 

Trp Gin Gin Gly Asn val Phe ser cys Ser val Met His Glu Ala Leu 
260 265 270 

His Asn His Tyr Thr Gin Lys Ser Leu Ser Leu Ser Pro Gly Lys 
275 280 285 

<210> 24 

<21t> 284 

<212> PRT 

<213> Artificial sequence 



<220> 

<223> synthetic construct 
<400> 24 

His Gly Glu Gly Thr Phe Thr Ser Asp Val Ser ser Tyr Leu Glu Glu 
1 5 10 15 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu Val Lvs Gly Arg Gly ser 
20 2S 30 

Ser Gly Ala Pro Pro Pro Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 
35 40 45 

Ser Gly Gly Gly Gly Ser Ala Glu Ser Lys Tyr Gly Pro Pro Cys Pro 
50 55 60' 

Ser Cys Pro Ala Pro Glu Phe Leu Gly Gly Pro ser Val Phe teu Phe 



Pro Pro Lys Pro Lys Asp Thr Leu Met lie ser Arg Thr Pro Glu Val 
85 90 95 

Thr cys val val val Asp val ser Gin Glu Asp Pro Glu val Gin Phe 
100 105 110 

Asn Trp Tyr val Asp Gly val Glu val His Asn Ala Lys Thr Lys pro 
115 120 125 

Arg Glu Glu Gin Phe Asn ser Thr Tyr Arg val val Ser val Leu Thr 
130 135 140 
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val Leu His Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val 
145 150 155 160 

Ser Asn Lys Gly Leu Pro Ser Ser lie Glu Lys Thr lie Ser Lys Ala 
165 170 175 

Lys Gly Gin Pro Arg Glu Pro Gin val Tyr Thr Leu Pro Pro ser Gin 
180 185 190 

Glu Glu Met Thr Lys Asn Gin Val ser Leu Thr c.ys Leu val Lys Gly 
195 200 205 

Phe Tyr Pro Ser Asp lie Ala Val Glu Trp Glu ser Asn Glv Gin pro 
210 215 220 

Glu Asn Asn Tyr Lys Thr Thr pro Pro Val Leu Asp Ser Asp Gly Ser 
225 230 235 240 

Phe Phe Leu Tyr Ser Arg Leu Thr Val Asp Lvs Ser Arg Trp Gin Glu 
245 250 ' 255 

Gly Asn val Phe ser cys ser val Met His Glu Ala Leu His Asn His 
260 ' 265 270 

Tyr Thr Gin Lys Ser Leu Ser Leu ser Leu Gly Lys 
275 2S0 

<210> 25 

<2H> 302 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 25 

His Gly Glu Gly Thr Phe Thr ser Asp val ser ser Tyr Leu Glu Glu 
1 5 10 IS 

Gin Ala Ala Lys Glu Phe lie Ala Trp Leu val Lys Gly Arg Gly ser 

Ser Gly Ala Pro Pro Pro ser Gly Gly Gly Gly Ser Gly Gly Gly Glv 
35 40 45 

ser Gly Gly Gly Gly ser Gly Gly Gly Gly Ser Gly Gly Glv Gly Ser 
50 55 60 



WO 02/4*227 PCT/BSM/4WSS 
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Gly Gly Gly Gly ser Ala Glu Pro. Lys Ser Cys Asp Lys Thr His Thr 
65 70 75 80 

Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser val Phe 
85 90 95 

Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met lie Ser Arg Thr Pro 
100 105 110 

Glu val Thr Cys Val Val val Asp Val Ser His Glu Asp Pro Glu Val 
115 120 125 

Lys Phe Asn Trp Tyr val Asp Gly Val Glu Val His Asn Ala Lys Thr 
130 135 140 

Lys Pro Arg Glu Glu Gin Tyr Asn ser Thr Tyr Arg val val ser val 
145 150 155 160 

Leu Thr val Leu His Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lvs Cys 
165 170 175 

Lys val ser Asn Lys Ala Leu Pro Ala Pro lie Glu Lys Thr He ser 
180 185 190 

Lys Ala Lys Gly Gin Pro Arg Glu Pro Gin val Tyr Thr Leu Pro Pro 
195 " 200 205 

Ser Arg Glu Glu Met Thr Lys Asn Glti Val ser Leu Thr cys Leu val 
210 215 220 

Lys Gly Phe Tyr Pro Ser Asp lie Ala val Glu Trp Glu ser Asn Gly 

225 230 235 240 

Gin Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser asp 
245 250 255 

Gly ser Phe Phe Leu Tyr ser Lys Leu Thr val Asp Lys ser Arg Trp 
■ 260 265 270 

Gin Gin Gly Asn val Phe ser Cys ser val Met His Glu Ala Leu His 
275 280 285 

Asn His Tyr Thr Gin Lys ser Leu Ser Leu Ser pro Gly Lys 
290 295 300 

<210> 26 
<211> 294 
<212> PRJ 
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<213> Artificial Sequence 



<22Q> 

<223> synthetic construct 
<400> 26 

His Gly Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Glu 
1 5 10 15 

Gin Ala Ala t.ys Glu Phe He Ala Trp Leu Val Lys Gly Arg Gly Gly 
20 25 30 

Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly ser Gly Gly 



Gly Gly ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Glu pro 
50 55 60 

Lys Ser cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu 



Leu Leu Gly Gly Pro ser val phe Leu phe Pro Pro Lys Pro Lys Asp 
85 90 95 

Thr Leu Met He ser Arg Thr pro Glu val Thr Cys val val val Asp 
100 105 110 

val ser His Glu Asp Pro Glu val Lys Phe Asn Trp Tyr val Asp Gly 
115 120 125 

Val Glu Val His Asn Ala Lys Thr i.ys Pro Arq Glu Glu Gin Tyr Asn 
130 135 * 140 

Ser Thr Tyr Arg val Val Ser val Leu Thr Val Leu His Gin Asp Trp 
145 150 155 150 

Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro 
165 170 175 

Ala pro lie Glu Lys Thr lie ser Lys Ala Lys Gly Gin Pro Arg Glu 
180 185 " 190 

Pro Gin Val Tyr Thr Leu pro Pro ser Arq Glu Glu Met Thr Lys Asn 
195 200 205 

Gin Val Ser Leu Thr Cys Leu val Lys Gly Phe Tyr Pro Ser Asp lie 
210 215 " 220 

Ala val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr 
225 230 " 235 240 

50 
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Thr Pro Pro val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr ser Lys 
245 ' 250 255 

Leu Thr val Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe Ser cys 
260 265 270 

Ser val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys ser Leu 
2.75 280 285 

ser Leti ser Pro Gly Lys 
290 

<210> 27 
<213> 280 
<212> PRT 

<213> Artificial sequence 
<22G> 

<223> synthetic construct 
<400> 27 

His Gly Glu Gly Thr Phe Thr ser Asp val Ser Ser Tyr Leu Glu Glu 
1 ' 5 10 15 

Gin Ala Ala Lys Glu Phe lie Ala rrp Leu val Lys Glv Arg Gly Ser 
20 25 30 

ser Gly Ala Pro Pro Pro ser Ser Ser Gly Ala Pro Pro Pro Ser Ala 
35 40 45 

Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala 
50 55 60 

Pro Glu Leu Leu Gly Gly pro Ser val Phe Leu Phe pro pro Lvs pro 
65 70 75 ' SO 

Lys Asp Thr Leu Met lie Ser Arg Thr pro Glu Val Thr Cys val val 
.85 90 95 

val Asp Val ser His Glu Asp Pro Glu val Lys Phe Asn Trp Tyr Val 
100 105 110 

Asp Gly val Glu val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 
115 120 12§ 

Tvr Asn Ser Thr Tyr Arg Val Val Ser val Leu Thr Val Leu His Gin 
130 135 140 
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Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala 
145 ISO 155 160 

Leu pro Ala Pro lie Glu Lys Thr lie Ser Lys Ala Lys Gly Gin Pro 
165 170 175 

Arg Glu Pro Gin val Tyr Thr Leu Pro Pro ser Arg Glu Glu mt Thr 
180 185 190 

Lys Asn Gin Val Ser Leu Thr Cys Leu val Lys Gly Phe Tyr Pro ser 
195 200 205 

Asp lie Ala Val Glu Trp Glu ser Asn Gly Gin Pro Glu Asn Asn Tyr 
210 215 220 

Lys Thr Thr Pro Pro val Leu Asp ser Asp Gly ser Phe Phe Leu Tyr 
225 230 235 240 

Ser Lys Leu Thr val Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe 
245 250 255 

Ser cys ser val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys 
260 265 270 

Ser Leu ser Leu ser Pro Gly Lys 
275 280 

<210> 28 

«211> 287 

<212> PRT 

<213> Artificial sequence 



<22Q> 

<223> synthetic construct 
<400> 28 

His Gly Glu Gly Thr Phe Thr ser Asp val ser ser Tyr Leu Glu Glu 
1 5 10 15 

Gin Ala val Lys Glu Phe lie Ala Trp Leu He Lys Gly Arg Gly ser 
. 20 25 30 

Ser Gly Ala Pro Pro Pro ser Gly Gly Gly Glv ser Gly Gly Gly Gly 
35 40 45 

Ser Gly Gly Gly Gly Ser Ala Glu pro Lys ser cys Asp Lys Thr His 
50 55 60 
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Thr Cys Pro Pro cys Pro Ala Pro Glu Leu Leu Gly Gly pro Ser val 
65 70 75 80 

Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met lie Ser Arg Thr 
85 90 95 

pro Glu val Thr cys Val val val Asp val Ser His Glu Asp Pro Glu 
100 105 110 

Val Lys Phe Asn Trp Tyr val Asp Gly val Glu Val His Asn Ala Lys 
115 ' 120 125 

Thr Lys Pro Arg G'iu Glu Gin Tvr Asn Ser Thr Tyr Arg val Val ser 
130 " 135 140 

Val Leu Thr val Leu His Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys 
145 150 155 160 

Cys Lys val ser Asn Lys Ala Leu Pro Ala Pro lie Glu Lys Thr lie 
165 170 175 

Ser Lys Ala Lys Gly Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu pro 
ISO 185 190 

Pro Ser Arg Glu Glu Met Thr Lys Asn Gin Val ser Leu Thr Cys Leu 
195 200 205 

val Lys Gly Phe Tyr Pro Ser Asp lie Ala val Glu Trp Glu ser Asn 
210 215 220 

Gly Gin pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro val Leu Asp Ser 
225 230 235 240 

Asp Gly ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys ser Arg 
245 250 255 

Trp Gin Gin Gly Asn Val phe ser cys Ser Val Met His Glu Ala Leu 
260 265 270 

His Asn His Tyr Thr Gin Lys Ser Leu Ser Leu Ser Pro Gly Lys 
275 280 285 

<210> 29 

<211> 272 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> synthetic construct 
<400> 29 

His Gly Glu Gly Thr Phe Thr Ser Asp Leu Ser Lys Gin Met Glu Glu 
15 10 IS 



Glu Ala Val Arg Leu Phe He Glu Trp Leu Lys Asn Gly Gly Pro Ser 
20 25 30 



ser Gly Ala Pro Pro Pro Ser Ala Glu Pro Lys Ser Cys asp Lys Thr 
35 40 45 



His Thr cys Pro Pro Cys Pro Ala pro Glu Lett Leu Gly Gly Pro ser 
50 55 60 



val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met lie Ser Arg 
65 70 75 80 



Thr pro Glu Val Thr Cys val val val Asp val ser His Glu Asp Pro 
85 90 95 



Glu val tys Phe Asn Trp Tyr val Asp Gly val Glu val His ash Ala 
100 105 110 



Lys Thr Lys Pro Arg Glu Glu Gin Tyr Asn Ser Thr Tyr Arg Val Val 
115 120 125 



Ser Val Leu Thr Val Leu His Gin Asp Trp Leu Asn Gly Lv$ Glu Tyr 
130 135 140 



Lys cys Lys val ser Asn Lys Ala Leu Pro Ala Pro lie Glu Lys Thr 
145 " 150 155 160 



lie Ser Lys Ala Lys Gly Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu 
165 170 175 



Pro Pro ser Arg Glu Glu Met Thr Lys Asn Gin val Ser Leu Thr Cys 
180 185 ISO 



Leu val lys Gly Phe Tyr Pro Ser Asp lie Ala val Glu Trp Glu ser 
195 200 205 



Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro val Leu Asp 
210 215 220 



Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr val Asp Lys Ser 
225 230 235 240 



Arg rrp Gin Gin Gly Asn Val phe ser cys ser val Met His Glu Ala 

7AK 7Zf\ -JSC 




250 



253 
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Leu His Asn His Tyr Thr Gin Lys Ser Leu ser Leu Ser Pro Gly Lys 
260 265 270 

<210> 30 
<2tl> 272 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 30 

His Gly Glu Gly Thr Phe Thr Ser Asp Leu ser Lys Gin Met Glu Glu 
1 5 10 15 

Glu Ala Val Arg Leu Phe He Glu Trp Leu Lys Asn Gly Gly Pro Ser 
20 25 30 

Ser Gly Ala Ser Ser Gly Ala Ala Glu Pro Lys Ser Cys Asp Lys Thr 
35 40 45 

His Thr Cys pro Pro cys Pro Ala Pro Glu Leu Leu Gly Gly Pro ser 
SO 55 60 

Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met lie Ser Arg 
65 70 75 80 

Thr Pro Glu val Thr cys Val Val Val Asp Val Ser His Glu Asp Pro 
85 90 95 

Glu Val Lys Phe Asn Trp Tyr val Asp Gly Val Glu Val His Asn Ala 
100 105 110 

Lys Thr Lys Pro Arg Glu Glu Gin Tyr Asn Ser Thr Tyr Arg Val Val 
115 120 125 

ser Val Leu Thr val Leu His Gin Asp Trp Leu Asn Gly Lys Glu Tyr 
130 135 140 

Lys cys Lys val Ser Asn Lys Ala Leu Pro Ala Pro lie Glu Lys Thr 
145 ISO 155 160 

lie Ser Lys Ala Lys Gly Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu 
165 170 175 

Pro pro ser Arg Glu Glu Met Thr Lys Asn Gin Val Ser Leu Thr cys 
ISO IS 5 190 

55 
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Leu Val Lys Gly Phe Tyr Pro Ser Asp lie Ala Val Glu Trp Glu Ser 
195 200 205 

Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp 
210 215 220 

ser Asp Gly ser phe Phe Leu Tyr ser Lys Leu Thr val Asp Lys ser 
225 230 235 240 

Arg Trp Gin Gin Gly Asn val Phe Ser Cys ser Val Met His Glu Ala 
245 250 255 

Leu His Asn His Tyr Thr Girt tys Ser Leu Ser Leu Ser Pro Gly Lys 
260 265 270 

<210> 31 
<211> 287 
<212> PRT 

<2I3> Artificial Sequence 



<220> 

<223> synthetic construct 
«400> 31 

His Gly Glu Gly Thr Phe Thr ser Asp Leu Ser Lys Gin Met Glu Glu 
15 10 IS 

Glu Ala val Arg Leu Phe lie Glu Trp Leu Lys Asn Gly Gly Pro Ser 
20 25 30 

ser Gly Ala Pro Pro Pro ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 
35 40 " 45 

Ser Gly Gly Gly Gly ser Ala Glu pro Lys Ser cys Asp Lys Thr His 
50 55 60 

Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val 
65 70 75 80 

Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met He Ser Arg Thr 
S5 90 95 

Pro Glu val Thr cys val val val Asp val ser His Glu Asp Pro Glu 
100 105 110 

Val Lys Phe Asn Trp Tyr val Asp Glv val Glu val His Asn Ala Lys 
115 120 125 

56 
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!5.txt 



Thr Lys Pro Arg Glu Glu Gin Tyr Asn Ser Thr Tyr Arg val val Ser 
130 135 140 



val Leu Thr val Leu His Gin Asp Trp Leu Asn Gly Lys Glu Tyr tys 
145 150 155 160 



Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro lie Glu Lys Thr lie 
165 170 175 



Ser Lys Ala Lys Gly Gin Pro Arg Glu Pro Gin val Tyr Thr Leu Pro 
180 .185 190 



pro ser Arg Glu Glu Met Thr Lys Asn Gin val Ser Leu Thr Cys Leu 
195 200 205 



Val Lys Gly Phe Tyr Pro Ser Asp lie Ala Val Glu Trp Glu Ser Asn 
210 215 220 



Gly Gin Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser 
225 230 235 240 



Asp Gly ser Phe Phe Leu Tyr ser Lys Leu Thr val Asp Lys ser Arg 
245 250 255 



Trp Gin Gin Gly Asn val Phe ser Cys Ser val Met: His Glu Ala Leu 
260 265 270 



Mis Asn His Tyr Thr Gin Lys Ser Leu Ser Leu Ser Pro Gly Lys 
275 280 285 



<210> 32 

<211> 232 

<21Z> PRT 

<213> Homo sapiens 



<400> 32 , 

Ala Glu Pro Lys ser cys Asp Lys Thr Hi s Thr cys Pro Pro cys Pro 
1 5 10 15 



Ala Pro Glu Lys Gly Gly Pro Ser val Phe Leu Phe Pro Pro Lys Pro 
20 25 30 



Lys Asp Thr Lys Met lie Ser Arg Thr Pro Glu Val Thr cys val val 
35 40 45 



val Asp val ser His Glu Asp Pro Glu val Lys Phe Asn Trp Tyr Val 



50 



55 



60 
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Asa Gly val Glu Val His Asn Ala Lys Thr Lys pro Arg Glu Glu Gin 
65 70 75 ' SO 

Tvr Asn Ser Thr Tyr Arg val Val Ser. val Leu Thr val Leu His Gin 
85 90 95 

Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala 

100 105 110 



Arg Glu Pro Gin val Tyr Thr Leu Pro pro ser Arg Glu Glu mt Thr 
130 135 140 

Lys Asn Gin val Ser Leu Thr cys Leu Val Lys Gly Phe Tyr Pro ser 
145 ' 150 155 160 

Asp lie Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr 
165 170 175 

Lys Thr Thr pro Pro val Leu Asp ser Asp Gly ser phe Phe Lea Tyr 
180 185 190 

Ser Lys Leu Thr val Asp Lys ser Arg Trp Gin Gin Gly Asn val Phe 
195 200 205 

Ser cys Ser val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys 
210 215 220 

Ser Leu Ser Leu Ser Pro Gly Lys 
230 



225 




<210> 


33 


<211> 


703 


<212> 


OHA 


«213> 


Homo 


<400> 


33 



gagcccaaat cttgtgacaa aactcacaca tgcccaccgt gcccagcacc tgaactcctg 60 

gggggaccgt cagtcttcct cttcccccca aaacecaagg acaccctcat gatctcccgg 120 

acccctgagg tcacatgcgt ggtggtggac gtgagccacg aagaccctga ggtcaagttc 180 

aactggtacg tggacggcgt ggaggtgcat aatgccaaga caaagccgcg ggaggagtag 240 

tacaacagca cgtaccgtgt ggtcagcgtc ctcaccgtcc tgcaccagga ctggctgaat 300 

t - ocaa ggtctccaac aaagccctcc cagcccccat cgagaaaacc 360 
58 
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atctccaaag ccaaagggca gececgagaa ccacaggtgt acaccctgcc cccatcccgg 420 

gaggagatga ccaagaacca ggtcagcctg acctgcctgg tcaaaggctt ctatcccagc 480 

gacatcgccg tggagtggga gagcaatggg cagccggaga acaactacaa gaccacgcct S40 

cccgtgctgg actccgacgg ctccttcttc ctctatagca agctcaccgt ggacaagagc 600 

aggtggcagc aggggaacgt cttctcatgc tccgtgatgc atgaggctct gcacaaccac 660 

tacacgcaga agagcctctx cctgtctccg ggtaaatgat agt 703 



<210> 


34 


<211> 


585 


<212> 


PRT 


<213> 


Homo 


<400> 


34 



Asp Ala His Lys Ser Glu Val Ala His Arg Phe Lys Asp Leu Gly 61 u 
1 5 10 15 



Glu Asn Phe Lys Ala Leu val Leu lie Ala Phe Ala Gin Tyr Leu Gin 

20 25 30 

Gin cys Pro Phe Glu Asp His val Lys Leu val Asn Glu val Thr Glu 
35 40 45 



Phe Ala Lys Thr Cys Val Ala Asp Glu 5er Ala Glu Asn cys Asp Lys 
50 55 60 



ser Leu His Thr Leu Phe Gly Asp Lys Leu cys Thr val Ala Thr Leu 



Arg Glu Thr Tyr Gly Glu Met Ala Asp Cys Cys Ala Lys Gin Glu Pro 
85 90 95 

Glu Arg Asn Glu Cys Phe Leu Gin His Lys Asp Asp Asn Pro Asn Leu 
100 105 110 

Pro Arg Leu Val Arg Pro Glu val Asp Val Met cys Thr Ala Phe His 
115 ~ 120 125 

Asp Asn Glu Glu Thr Phe Leu Lys Lys Tyr Leu Tyr Glu lie Ala Arg 
130 135 140 

Arg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe Ala Lys Arg 



65 



70 



SO 



14* 



155 




Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gin Ala Ala Asp Lys Ala Ala 
165 170 175 

59 
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cys 



Pro 
180 



Lys 




Gly 



190 



Ser Ala Lys Gin Arq Leu Lys cys Ala Ser Leu Gin Lys Phe Gly Glu 
195 " 200 205 



Arg Ala Phe Lys Ala Trp Ala val Ala Arg leu Ser Gin Arg Phe Pro 
210 215 220 



Lys Ma Glu Phe Ala clu val ser Lys Leu val Thr Asp Leu Thr Lys 
225 230 235 240 



val His Thr clu cys Cys His Gly Asp Leu Leu Glu cys Ala Asp Asp 
245 250 25§ 



Arg Ala Asp Leu Ala Lys Tyr lie cys Glu Asn Gin Asp Ser lie Ser 
260 265 270 



Ser Lys Leu Lys Glu cys Cys Glu Lys Pro Leu Leu Glu Lys Ser His 
275 280 285 



Cys lie Ala Glu val Glu Asn asp Glu Met pro Ala Asp Leu Pro Ser 
290 295 300 



Leu Ala Ala Asp Phe val Glu ser Lys Asp val cys Lys Asn Tyr Ala 
305 310 ' 315 320 



Glu Ala Lys Asp val Phe Leu Gly Met Phe teu Tyr Glu Tyr Ala Arg 
325 330 335 " 



Arg His Pro Asp Tyr ser val val Leu Leu Leu Arg Leu Ala Lys Thr 
340 345 350 



Tyr Glu Thr Thr Leu Glu Lys cys Cys Ala Ala Ala Asp Pro His Glu 
355 360 365 



Cys Tyr Ala Lys Val Phe Asp Glu Phe Lys Pro Leu val Glu Glu Pro 
370 375 380 



Gin Asn Leu He Lys Gin Asn cys Glu Leu Phe Glu Asn Leu Gly Glu 
385 390 395 400 



Tyr Lys Phe Gin Asn Ala Leu Leu Val Arg Tyr Thr Lys Lys val pro 
405 410 415 



Gin Val Ser Thr pro Thr Leu val Glu Val ser Arg Asn Leu Gly Lys 
420 425 430 



Val Gly Ser Lys Cys cys Lys His Pro Glu Ala Lys Arg wet Pro cys 



435 



440 




60 
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Ala Glu Asp Tyr Leu set- vai val Leu Asn Gin Leu cys val Leu His 
450 455 460 

Glu Lys Thr Pro val ser Asp Arg val Thr Lys Cys Cys Thr Glu Ser 



Leu val Asn Arg Arg Pro Cys Phe Ser Ala Leu Glu Val Asp Glu Thr 
485 490 495 

Tyr val Pro Lys Glu Phe Asn Ala Glu Thr Phe Thr Phe His Ala Asp 
500 505 510 

lie Cys Thr Leu Ser Glu Lys Glu Arg Gin lie Lys Lys Gin Thr Ala 
515 520 525 

Leu val Glu Leu val Lys His Lys Pro Lys Ala Thr Lys Glu Gin Leu 
530 ' 535 540 

Lys Ala Val Met Asp Asp phe Ala Ala Phe Val Glu Lvs Cys Cys Lys 
545 550 555 560 

Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu Glu G'ly Lys Lys Leu Val 
565 570 ' 575 

Ala Ala ser Gin Ala Ala Leu Gly Leu 
580 585 

<210> 35 

<211> 1762 

<212> DNA 

<213> Homo sapiens 

<400> 35 

gatgcgcaca agagtgaggt tgctcatcgg tttaaagatt tgggagaaga aaatttcaaa 60 

gccttggtgt tgattgcctt tgctcagtat cttcagcagt gtccatttga agatcatgta 120 

aaattagtga atgaagtaac tgaatttgca aaaacatgtg ttgctgatga gtcagctgaa 180 

aattgtgaca aatcacttca tacccttttt ggagacaaat tatgcacagt tgcaactctt 240 

cgtgaaacct atggtgaaat ggctgactgc tgtgcaaaac aagaacctga gagaaatgaa 300 

tgcttcttgc aacacaaaga tgacaaccca aacctccccc gattggtgag accagaggtt 360 

gatgtgatgt gcactgcttt tcatgacaat gaagagacat ttttgaaaaa atacttatat 420 

gaaattgcca gaagacatxc ttacttttat gccccggaac tccttttctt tgctaaaagg 480 

tataaagctg cttttacaga atgttgccaa gctgctgata aagctgcctg cctgttgcca 540 

aagctcgatg aacttcggga tgaagggaag gcttcgtctg ccaaacagag actcaagtgt 600 



465 



470 




480 



51 
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gccagtctcc aaaaatttgg agaaagagct ttcaaagcat gggcagtagc tcgcctgagc 660 

cagagatttc ccaaagctga gtttgcagaa gtttccaagt tagtgacaga tcttaccaaa 720 

gtccacacgg aatgctgcca tggagatctg ettgaatgtg ctgatgacag ggcggacctt 780 

gccaagtata tctgtgaaaa tcaagattcg atctccagta aactgaagga atgctgtgaa 840 

aaacctctgt tggaaaaatc ecactgeatt gccgaagtgg aaaatgatga gatgcctgct 900 

gacttgcctt cattagctgc tgattttgtt gaaagtaagg atgtttgcaa aaactatgct 960 

gaggcaaagg atgtcttcct gggcatgttt ttgtatgaat atgcaagaag gcatcctgat 1020 

tactctgtcg tgctgctgct gagacttgcc aagacatatg aaaccactct agagaagtgc 1080 

tgtgecgctg cagatcctca tgaatgctat gccaaagtgt tcgatgaatt taaacctctt 1140 

gtggaagagc ctcagaattt aatcaaacaa aattgtgagc , tttttgagca gcttggagag 1200 

tacaaattr.c agaatgcgct attagttcgt tacaccaaga aagtacccca agtgtxaact 1260 

ccaactcttg tagaggtctc aagaaaccta ggaaaagtgg gcagcaaatg ttgtaaacat 1320 

cctgaagcaa aaagaatgcc ctgtgcagaa gactatctat ccgtggtcct gaaccagtta 1380 

tgtgtgttgc atgagaaaac gccagtaagt gacagagtca ccaaatgctg cacagaatcc 1440 

ttggtgaaca ggcgaccatg cttttcagct ctggaagtcg atgaaacata cgttcccaaa 1500 

gagtttaatg ctgaaacatt caccttccat gcagatatat gcacactttc tgagaaggag 1560 

agacaaatca agaaacaaac. tgcacttgtt gagctcgtga aacacaagcc caaggcaaca 1620 

aaagagcaac tgaaagctgt tatggatgat ttcgcagctt ttgtagagaa gtgctgcaag 1680 

gctgacgata aggagacctg ctttgccgag gagggtaaaa aacttgttgc tgcaagtcaa 1740 

gctgccttag gcttataatg ac 1762 
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